GenPlan: Generative Sequence Models as Adaptive Planners

Akash Karthikeyan · Yash Vardhan Pant

University of Waterloo

Accepted in AAAI 2025

Abstract

Sequence models have demonstrated remarkable success in behavioral planning by leveraging previously collected demonstrations. However, solving multi-task missions remains a significant challenge, particularly when the planner must adapt to unseen constraints and tasks, such as discovering goals and unlocking doors. Such behavioral planning problems are challenging to solve due to: a) agents failing to adapt beyond the single task learned through their reward function, and b) inability to generalize to new environments, e.g., those with walls and locked doors, when trained only in planar environments. Consequently, state-of-the-art decision-making methods are limited to missions where the required tasks are well-represented in the training demonstrations and can be solved within a short (temporal) planning horizon. To address this, we propose \model: a stochastic and adaptive planner that leverages discrete-flow models for generative sequence modeling, enabling sample-efficient exploration and exploitation. This framework relies on an iterative denoising procedure to generate a sequence of goals and actions. This approach captures multi-modal action distributions and facilitates goal and task discovery, thereby generalizing to out-of-distribution tasks and environments, i.e., missions not part of the training data. We demonstrate the effectiveness of our method through multiple simulation environments. Notably, \model outperforms state-of-the-art methods by over $10%$ on adaptive planning tasks, where the agent adapts to multi-task missions while leveraging demonstrations from single-goal-reaching tasks.

Install

We recommed installing using conda

git clone https://github.com/CL2-UWaterloo/GenPlan.git
cd GenPlan
env create -f env.yaml
conda activate plan

Installing the MiniGrid Environment

git clone https://github.com/Aku02/Minigrid.git
cd Minigrid
pip install -e .

See the MiniGrid repository for information on how to get

Usage

Refer to the configuration files in the config directory. We provide example for 2 envs

Activate conda environment and login to wandb (if you haven't already).

WANDB_MODE='online' python train_genplan.py

Contact

In case of any issues, please email: akashuwtce@gmail.com

Acknowledgement

We thank the authors for sharing the code!

Citation

@misc{karthikeyan2024genplangenerativesequencemodels,
      title={GenPlan: Generative sequence models as adaptive planners}, 
      author={Akash Karthikeyan and Yash Vardhan Pant},
      year={2024},
      eprint={2412.08565},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2412.08565}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
babyai/demos		babyai/demos
config		config
mingpt		mingpt
.gitignore		.gitignore
README.md		README.md
env.yaml		env.yaml
train_genplan.py		train_genplan.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenPlan: Generative Sequence Models as Adaptive Planners

Abstract

Install

Usage

Contact

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Languages

CL2-UWaterloo/GenPlan

Folders and files

Latest commit

History

Repository files navigation

GenPlan: Generative Sequence Models as Adaptive Planners

Abstract

Install

Usage

Contact

Acknowledgement

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages