Scikit-decide

AI framework for Reinforcement Learning, Automated Planning and Scheduling

Get Started →

Problem solving

Describe your decision-making problem once and auto-match compatible solvers.

Growing catalog

Enjoy a growing list of domains & solvers catalog, supported by the community.

Open & Extensible

Scikit-decide is open source and is able to wrap existing state-of-the-art domains/solvers.

# Welcome to scikit-decide

# What is it?

Scikit-decide is an AI framework for Reinforcement Learning, Automated Planning and Scheduling.

This framework was initiated at Airbus (opens new window) AI Research and notably received contributions through the ANITI (opens new window) and TUPLES (opens new window) projects, and also from ANU (opens new window).

TIP

Please refer to the Guide and Reference sections at the top to learn how to use scikit-decide.

# Main features

  • Problem solving: describe your decision-making problem once and auto-match compatible solvers.
    For instance planning/scheduling problems can be solved by RL solvers using GNNs.
  • Growing catalog: enjoy a growing list of domains & solvers catalog, supported by the community.
  • Open & Extensible: scikit-decide is open source and is able to wrap existing state-of-the-art domains/solvers.
  • Domains available:
  • Solvers available:
    • RL solvers from ray.rllib and stable-baselines3
      • existing algos with action masking
      • adaptation of RL algos for graph observation, based on GNNs from pytorch-geometric (opens new window) 🆕
      • autoregressive models with action masking component by component for parametric actions 🆕
    • Planning solvers from unified-planning (opens new window) library
    • RDDL solvers jax and gurobi-based based on pyRDDLGym-jax and pyRDDLGym-gurobi from pyrddl-gym project (opens new window)
    • Search solvers coded in scikit-decide library:
      • A*, AO*, Improved-LAO*
      • Value Iteration (VI), Policy Iteration (PI)
      • Labeled RTDP, Learning Real-Time A*
      • LDFS (Label-correcting Depth-First Search), Iterative Deepening A*
      • SSiPP (Short-Sighted Planning), FRET (Find, Revise, Eliminate Traps)
      • iDual (LP-based SSP solver), Goal Probability and Cost Iteration (GPCI)
      • Best First Width Search, Iterated Width (IW), Rollout IW (RIW)
      • Monte Carlo Tree Search (MCTS), POMCP
      • DESPOT, SARSOP, Witness (POMDP solvers)
      • RTDP-Bel (belief-space RTDP), HSVI / GoalHSVI
      • SSPReplan, SSPDetHindsight, SSPPlanMerger (determinization approaches)
      • Multi-Agent RTDP, Multi-Agent Heuristic meta-solver (MAHD)
    • (Probabilistic) PDDL (PPDDL) solvers:
      • FF planner
      • FFReplan / PPDDLReplan (replanning with pluggable inner solvers)
      • FFDetHindsight / PPDDLDetHindsight (determinization in hindsight)
      • RFF / PPDDLPlanMerger (plan aggregation into a policy)
    • PDDL heuristics (with their probabilistic extensions):
      • Delete-Relaxation heuristics
      • FF Heuristic
    • PDDL+ parser and simulators with Probabilistic PDDL extensions
    • Evolution strategy: Cartesian Genetic Programming (CGP)
    • Scheduling solvers from discrete-optimization (opens new window),
  • Tuning solvers hyperparameters
    • hyperparameters definition
    • automated study with optuna