Paper Feed ================ Must-read papers on LLM-driven automated planning specification. New Papers! ------------------------------------------------------- + **"Make Planning Research Rigorous Again!"** Katz et al. (2025) `[paper] `__ + **"Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM"** Attolino et al. (2025) `[paper] `__ `[code] `__ + **"LODGE: Joint Hierarchical Task Planning and Learning of Domain Models with Grounded Execution"** Kienle et al. (2025) `[paper] `__ `[code] `__ + **"Large Language Models for Planning: A Comprehensive and Systematic Survey"** Cao et al. (2025) `[paper] `__ `[code] `__ + **"Text2World: Benchmarking Large Language Models for Symbolic World Model Generation"** Hu et al. (2025) `[paper] `__ `[code] `__ Paper List ------------------------------------------------------- This section presents a taxonomy of research within Model Construction, organized into three broad categories: Model Generation, Model Editing, and Model Benchmarks. Within each category, the most recent contributions are listed first (then alphabetically). **Model Generation** ~~~~~~~~~~~~~~~~~~~~ **Task Modeling** .. raw:: html
.. list-table:: * - **"Instruction-Augmented Long-Horizon Planning: Embedding Grounding Mechanisms in Embodied Mobile Manipulation"** Wang et al. (2025) - `[paper] `__ `[code] `__ * - **"TIC: Translate-Infer-Compile for accurate 'text to plan' using LLMs and logical intermediate representations"** Agarwal and Sreepathy (2024) - `[paper] `__ * - **"AutoGPT+P: Affordance-based Task Planning with Large Language Models"** Birr et al. (2024) - `[paper] `__ `[code] `__ * - **"Leveraging LLMs for Generating Document-Informed Hierarchical Planning Models: A Proposal"** Fine-Morris et al. (2024) - `[paper] `__ * - **"A Demonstration of Natural Language Understanding in Embodied Planning Agents"** Grover and Mohan (2024) - `[paper] `__ * - **"CaStL: Constraints as Specifications through LLM Translation for Long-Horizon Task and Motion Planning"** Guo et al. (2024) - `[paper] `__ * - **"PlanCollabNL: Leveraging Large Language Models for Adaptive Plan Generation in Human-Robot Collaboration"** Izquierdo-Badiola et al. (2024) - `[paper] `__ * - **"Enabling Semantic Reasoning in Robots through Natural Language Processing"** Kalland (2024) - `[paper] `__ * - **"Thought of Search: Planning with Language Models Through The Lens of Efficiency"** Katz et al. (2024) - `[paper] `__ * - **"Fast and Accurate Task Planning using Neuro-Symbolic Language Models and Multi-level Goal Decomposition"** Kwon et al. (2024) - `[paper] `__ `[code] `__ * - **"Planning AI Assistant for Emergency Decision-Making (PlanAID): Framing Planning Problems and Assessing Plans with Large Language Models"** Lee et al. (2024) - `[paper] `__ * - **"Safe Planner: Empowering Safety Awareness in Large Pre-Trained Models for Robot Task Planning"** Li et al. (2024) - `[paper] `__ * - **"Towards Human Awareness in Robot Task Planning with Large Language Models"** Liu et al. (2024) - `[paper] `__ * - **"LLM Reasoner and Automated Planner: A New NPC Approach"** Merino and Sabater-Mir (2024) - `[paper] `__ * - **"Bootstrapping Object-level Planning with Large Language Models"** Paulius et al. (2024) - `[paper] `__ * - **"TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners"** Rosa et al. (2024) - `[paper] `__ * - **"TwoStep: Multi-agent Task Planning using Classical Planners and Large Language Models"** Singh et al. (2024) - `[paper] `__ `[code] `__ * - **"Anticipate & Collab: Data-driven Task Anticipation and Knowledge-driven Planning for Human-robot Collaboration"** Singh et al. (2024) - `[paper] `__ `[code] `__ * - **"PDDLEGO: Iterative Planning in Textual Environments"** Zhang et al. (2024) - `[paper] `__ `[code] `__ * - **"LaMMA-P: Generalizable Multi-Agent Long-Horizon Task Allocation and Planning with LM-Driven PDDL Planner"** Zhang et al. (2024) - `[paper] `__ `[code] `__ * - **"LGMCTS: Language-Guided Monte-Carlo Tree Search for Executable Semantic Object Rearrangement"** Chang et al. (2023) - `[paper] `__ `[code] `__ * - **"AutoTAMP: Autoregressive Task and Motion Planning with LLMs as Translators and Checkers"** Chen et al. (2023) - `[paper] `__ `[code] `__ * - **"Dynamic Planning with a LLM"** Dagan et al. (2023) - `[paper] `__ `[code] `__ * - **"Task and Motion Planning with Large Language Models for Object Rearrangement"** Ding et al. (2023) - `[paper] `__ `[code] `__ * - **"LLM+P: Empowering Large Language Models with Optimal Planning Proficiency"** Liu et al. (2023) - `[paper] `__ `[code] `__ * - **"Faithful Chain-of-Thought Reasoning"** Lyu et al. (2023) - `[paper] `__ `[code] `__ * - **"Vision-Language Interpreter for Robot Task Planning"** Shirai et al. (2023) - `[paper] `__ `[code] `__ * - **"Translating natural language to planning goals with large-language models"** Xie et al. (2023) - `[paper] `__ `[code] `__ * - **"Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behaviour in out-of-distribution reasoning tasks"** Collins et al. (2022) - `[paper] `__ `[code] `__ .. raw:: html
**Domain Modeling** .. raw:: html
.. list-table:: * - **"Predicate Invention from Pixels via Pretrained Vision-Language Models"** Athalye et al. (2024) - `[paper] `__ * - **"Language-Augmented Symbolic Planner for Open-World Task Planning"** Chen at al. (2024) - `[paper] `__ * - **"Planning in the Dark: LLM-Symbolic Planning Pipeline without Experts"** Huang et al. (2024) - `[paper] `__ `[code] `__ * - **"Learning Compositional Behaviors from Demonstration and Language"** Liu et al. (2024) - `[paper] `__ * - **"Using Large Language Models to Extract Planning Knowledge from Common Vulnerabilities and Exposures"** Oates et al. (2024) - `[paper] `__ `[code] `__ * - **"Large Language Models as Planning Domain Generators"** Oswald et al. (2024) - `[paper] `__ `[code] `__ * - **"Autonomously Learning World-Model Representations For Efficient Robot Planning"** Shah (2024) - `[paper] `__ * - **"Creating PDDL Models from Javascript using LLMs: Preliminary Results"** Sikes et al. (2024) - `[paper] `__ * - **"Leveraging LLMs for HTN domain model generation via prompt engineering"** Sinha (2024) - `[paper] `__ * - **"Making Large Language Models into World Models with Precondition and Effect Knowledge"** Xie at al. (2024) - `[paper] `__ * - **"PROC2PDDL: Open-Domain Planning Representations from Texts"** Zhang et al. (2024) - `[paper] `__ `[code] `__ * - **"Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds"** Ding et al. (2023) - `[paper] `__ `[code] `__ * - **"Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning"** Guan et al. (2023) - `[paper] `__ `[code] `__ * - **"Learning adaptive planning representations with natural language guidance"** Wong et al. (2023) - `[paper] `__ .. raw:: html
**Hybrid Modeling** .. raw:: html
.. list-table:: * - **"NL2Plan: Robust LLM-Driven Planning from Minimal Text Descriptions"** Gestrin et al. (2024) - `[paper] `__ `[code] `__ * - **"InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning"** Han et al. (2024) - `[paper] `__ `[code] `__ * - **"Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming"** Hao et al. (2024) - `[paper] `__ `[code] `__ * - **"AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation"** Hu et al. (2024) - `[paper] `__ `[code] `__ * - **"DELTA: Decomposed Efficient Long-Term Robot Task Planning using Large Language Models"** Liu et al. (2024) - `[paper] `__ * - **"Leveraging Environment Interaction for Automated PDDL Generation and Planning with Large Language Models"** Mahdavi et al. (2024) - `[paper] `__ * - **"Consolidating Trees of Robotic Plans Generated Using Large Language Models to Improve Reliability"** Sakib and Sun (2024) - `[paper] `__ * - **"Toward a Method to Generate Capability Ontologies from Natural Language Descriptions"** Silva et al. (2024) - `[paper] `__ * - **"Generating consistent PDDL domains with Large Language Models"** Smirnov et al. (2024) - `[paper] `__ * - **"MORPHeus: a Multimodal One-armed Robot-assisted Peeling System with Human Users In-the-loop"** Ye et al. (2024) - `[paper] `__ `[code] `__ * - **"There and Back Again: Extracting Formal Domains for Controllable Neurosymbolic Story Authoring"** Kelly et al. (2023) - `[paper] `__ `[code] `__ * - **"The Neuro-Symbolic Inverse Planning Engine (NIPE): Modeling Probabilistic Social Inferences from Linguistic Inputs"** Ying et al. (2023) - `[paper] `__ * - **"ISR-LLM: Iterative Self-Refined Large Language Model for Long-Horizon Sequential Task Planning"** Zhou et al. (2023) - `[paper] `__ `[code] `__ .. raw:: html
---- **Model Editing** ~~~~~~~~~~~~~~~~~ .. raw:: html
.. list-table:: * - **"Can LLMs Fix Issues with Reasoning Models? Towards More Likely Models for AI Planning"** Caglar et al. (2024) - `[paper] `__ * - **"LLMs for AI Planning: A Study on Error Detection and Correction in PDDL Domain Models"** Patil (2024) - `[paper] `__ * - **"Traversing the Linguistic Divide: Aligning Semantically Equivalent Fluents Through Model Refinement"** Sikes et al. (2024) - `[paper] `__ * - **"Exploring the limitations of using large language models to fix planning tasks"** Gragera and Pozanco (2023) - `[paper] `__ .. raw:: html
---- **Model Benchmarks** ~~~~~~~~~~~~~~~~~~~~ **LLMs-as-Planners** .. raw:: html
.. list-table:: * - **"A Roadmap to Guide the Integration of LLMs in Hierarchical Planning"** Puerta-Merino et al. (2025) - `[paper] `__ `[code] `__ * - **"Exploring and Benchmarking the Planning Capabilities of Large Language Models"** Bohnet et al. (2024) - `[paper] `__ * - **"ACPBench: Reasoning about Action, Change, and Planning"** Kokel et al. (2024) - `[paper] `__ `[code] `__ * - **"TravelPlanner: A Benchmark for Real-World Planning with Language Agents"** Xie et al. (2024) - `[paper] `__ `[code] `__ * - **"NATURAL PLAN: Benchmarking LLMs on Natural Language Planning"** Zheng et al. (2024) - `[paper] `__ `[code] `__ * - **"Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning"** (Household) Guan et al. (2023) - `[paper] `__ `[code] `__ * - **"Automating the Generation of Prompts for LLM-based Action Choice in PDDL Planning"** Stein et al. (2023) - `[paper] `__ `[code] `__ * - **"ON THE PLANNING ABILITIES OF LARGE LANGUAGE MODELS (A CRITICAL INVESTIGATION WITH A PROPOSED BENCHMARK)"** Valmeekam et al. (2023) - `[paper] `__ `[code] `__ * - **"PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change"** Valmeekam et al. (2023) - `[paper] `__ `[code] `__ * - **"ALFWorld: Aligning Text and Embodied Environments for Interactive Learning"** Shridhar et al. (2021) - `[paper] `__ `[code] `__ .. raw:: html
**LLMs-as-Formalizers PDDL Benchmarks** .. raw:: html
.. list-table:: * - **"Text2World: Benchmarking Large Language Models for Symbolic World Model Generation"** Hu et al. (2025) - `[paper] `__ `[code] `__ * - **"Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages"** Zuo et al. (2024) - `[paper] `__ `[code] `__ .. raw:: html
The following is the core summary of model generation frameworks in `"LLMs as Planning Formalizers: A Survey for Leveraging Large Language Models to Construct Automated Planning Models" `_: .. image:: _static/survey_table.png :alt: no image available