About The Workshop
This workshop explores the growing capabilities of large language models (LLMs), such as OpenAI's o1 model, in reasoning, planning, and decision-making, highlighting recent advances and challenges. We aim to examine how reinforcement learning methods, post-training optimization, and efficient inference techniques can further enhance LLMs' reasoning capabilities. Topics include training approaches for enhancing reasoning and planning abilities, scaling inference for complex tasks, developing robust benchmarks, and extending LLMs to multi-modal and embodied environments. We will also discuss broader themes such as causal reasoning, collaborative multi-agent systems, uncertainty, and explainability to offer insights and guidance for the further development of reasoning and planning in LLMs.
Schedule
| Time (SGT) | Session | Speaker | Talk Title | 
|---|---|---|---|
| 08:30 – 08:40 | Introduction and Opening Remarks | ||
| 08:40 – 09:10 | Invited Talk 1 | Yuandong Tian (Meta) | Reason by Search or by Representation? A Path Towards Unifying Neural and Symbolic Decision Making | 
| 09:10 – 09:40 | Invited Talk 2 | Guy Van den Broeck (UCLA) | Symbolic Reasoning about Large Language Models | 
| 09:40 – 09:50 | Tea Break | ||
| 09:50 – 10:20 | Invited Talk 3 | Yarin Gal (Oxford) | AI models collapse when trained on recursively generated data | 
| 10:20 – 10:50 | Invited Talk 4 | Natasha Jaques (UW & Google DeepMind) | Social Reasoning for Large Language Models | 
| 10:50 – 11:50 | Panel Discussion | All Speakers (Yuandong, Junxian, Yarin, Bo) | |
| 12:00 – 13:30 | Poster Session 1 and Lunch Break | ||
| 13:45 – 14:15 | Invited Talk 5 | Stephen McAleer (OpenAI) | Toward Capable and Safe Virtual Agents | 
| 14:15 – 14:25 | Oral Paper 1 | Yuyang Wu | When More is Less: Understanding Chain-of-Thought Length in LLMs | 
| 14:25 – 14:35 | Oral Paper 2 | Yu-Ting Lee & Hui-Ting Shih | RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner | 
| 14:35 – 14:45 | Oral Paper 3 | Harshita Chopra | Feedback-Aware Monte Carlo Tree Search for Efficient Information Seeking in Goal-Oriented Conversations | 
| 14:45 – 14:55 | Oral Paper 4 | Hanze Dong | Offline Reinforcement Learning for LLM Multi-Step Reasoning | 
| 14:55 – 16:00 | Poster Session 2 | ||
| 16:00 – 16:30 | Invited Talk 6 | Bo An (NTU) | From Algorithmic and RL-based to LLM-powered Agents | 
| 16:30 – 17:00 | Invited Talk 7 | Junxian He (HKUST) | Taming Reinforcement Learning for Effective and Efficient Reasoners | 
| 17:00 – 17:10 | Oral Paper 5 | Yunchao Hao | Rethinking Fine-tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning | 
| 17:10 – 17:20 | Oral Paper 6 | Anikait Singh | Improving Test-Time Search for LLMs with Backtracking Against In-Context Value Verifiers | 
| 17:20 – 17:30 | Oral Paper 7 | Niklas Muennighoff | s1: Simple Test-Time Scaling | 
| 17:30 – 17:40 | Paper Award & Closing Remarks | ||
Topics
The workshop will cover a range of topics, including but not limited to:
We will explore the application of RL algorithms and other effective approaches in enhancing LLM reasoning and planning abilities during both pre-training and post-training stages. We will examine how techniques like Reinforcement Learning from Human Feedback (RLHF) can be adapted and expanded for efficient reasoning. Key questions include:
- How can RL and other effective methods be utilized in pre-training to improve reasoning abilities?
- What post-training approaches (e.g., fine-tuning, RLHF) are most effective for LLM planning tasks?
- How can synthetic data generation and self-supervised training enhance LLM reasoning and planning?
We will discuss challenges and innovations in scaling up reasoning during inference. As models become larger and tasks more complex, efficient inference mechanisms are critical. Topics of interest include:
- What are the most promising methods for scaling inference times in reasoning-heavy tasks?
- How can models dynamically allocate resources during inference to optimize for reasoning and planning?
Developing robust benchmarks for evaluating reasoning and planning in LLMs is critical to track progress. This session will address the need for new metrics and standardized tasks to assess reasoning abilities across different scenarios. Key discussions will include:
- What benchmarks can accurately reflect the reasoning and planning capabilities of LLMs?
- How do we design tasks that evaluate long-horizon reasoning and complex decision-making?
As LLMs increasingly integrate with multi-modal environments, reasoning across multiple data types (e.g., vision, sound, text) becomes more essential. This session will explore the application of reasoning and planning in multi-modality and embodied AI systems, including robotics and real-world interactions:
- How can LLMs enhance multi-modal reasoning and planning to better interact with diverse environments?
- What are the key challenges and opportunities in applying LLMs to multi-modal tasks, including those requiring embodied reasoning?
In addition to the core themes mentioned above, our discussions will also encompass a broader range of emerging topics, including:
- Causal Reasoning: How can LLMs move beyond pattern recognition to infer causal relationships?
- Collaborative Reasoning in Multi-Agent Systems: How can LLMs enable multi-agent cooperation for distributed tasks?
- Uncertainty and Robustness: How can LLMs improve reasoning under ambiguous information?
- Human-in-the-Loop Systems: How can human feedback refine LLM decision-making processes?
- Explainability: How can we make LLM reasoning and planning more transparent and interpretable for real-world applications?
Call For Papers
The Reasoning and Planning for LLMs@ICLR 2025 invites submissions on the development of novel architectures, algorithms, theoretical analyses, empirical studies, and applications in reasoning and planning with LLMs. Submissions must present original, unpublished research.
Key Dates
- 
            
            Paper Deadline: 
            February 2, 2025 (AOE)February 6, 2025 (AOE)
- Notification: March 5, 2025, (AOE)
- Camera-ready: March 19, 2025
Submission Site
          Submissions will be managed via OpenReview. Papers will remain private during the review process. All authors must maintain up-to-date OpenReview profiles to ensure proper conflict-of-interest management and paper matching. Incomplete profiles may result in desk rejection.  
          Learn how to create an OpenReview profile here.
          
          
          Submit papers through the Reasoning and Planning for LLMs Workshop Submission Portal on OpenReview (Reasoning and Planning for LLMs Workshop Submission Portal).
        
Scope
We welcome contributions across a broad spectrum of topics, including but not limited to:- Training methodologies for enhancing reasoning and planning in LLMs
- Efficient inference for complex reasoning tasks
- Benchmarking reasoning and planning capabilities
- Multi-modality and embodiment in LLMs
- Emerging trends in LLM reasoning and planning
Submission Guidelines
Formatting Requirements
Submissions must be in English and follow the Reasoning and Planning for LLMs Workshop LaTeX Template (adapted from the ICLR 2025 template).Papers must be submitted as a single PDF file:
- Long Papers: at most 9 pages (main text)
- Tiny Papers: between 2 and 4 pages (main text)
- References and appendices are not included in the page limit, but the main text must be self-contained. Reviewers are not required to read beyond the main text.
Submissions exceeding the page limit will be desk rejected.
Anonymity
The workshop follows a double-blind review process. Submissions must be anonymized by removing author names, affiliations, and acknowledgments. Prior work should be cited in the third person. Identifying information, including in supplementary materials, must be omitted.Dual Submission and Non-Archival Policy
Submissions under review at other venues will be accepted, provided they do not breach any dual-submission or anonymity policies of those venues. Submissions will not be indexed or have archival proceedings. We welcome ICML 25 or ACL 25 submissions.Transparency
By submitting to the Reasoning and Planning for LLMs Workshop, authors agree that for all accepted papers, the original submission, reviews, and meta-reviews will be made publicly available on OpenReview.Contact
Email at zhiyuanhucs@gmail.comAccepted Paper
Student Registration Grant
We are excited to offer a limited number of free full conference, “student early” registrations for ICLR 2025, exclusively for full-time students attending in person. This initiative aims to support early-career researchers while fostering diversity, equity, and inclusion (DEI) in the academic community.
Selection Criteria
Applications will be evaluated based on the strength of the submitted materials (see details below). Priority will be given to students presenting papers at our workshop who lack alternative travel support.
How to Apply
Interested students must complete the application form here by 11:59pm (AoE) on March 5, 2025, which includes the following:
- Personal & Academic Details: Name, affiliation, and relevant academic information
- CV/Resume
- Paper ID: Accepted or submitted to our workshop
- Statement of Interest: A brief paragraph explaining how this opportunity will benefit your research and career
- Attendance Confirmation: A clear statement confirming that you will attend in person
Important Notes
- Awardees will be announced in March 10, 2025
- If you have already registered, please submit your receipt, and we will provide further instructions
- Travel and accommodations must be arranged independently—this grant covers registration only
This opportunity is highly competitive, and we encourage all eligible students to apply early!
Registration Grant Recipients
- Gonzalo Gonzalez-Pumariega – Cornell University
- Niklas Muennighoff – Stanford University
- Annya Dahmani – University of California, Berkeley
- Constantin Venhoff – Oxford University
- Harshita Chopra – University of Washington
- Yongchao Chen – Harvard University
- Bo Liu – National University of Singapore
- Xingcheng Yao – University of California, Los Angeles
- Fangru Lin – Oxford University
- Chenchen Ye – University of California, Los Angeles
- Tsz Hang Wong – Imperial College London
- Hui Yuan – Princeton University
Speakers and Panelists
Organizers
This workshop is organized by

 
               
               
               
               
               
               
               
               
               
               
               
               
               
               
               
               
              