Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning
☆25Jan 5, 2026Updated last month
Alternatives and similar repositories for Embodied-Planner-R1
Users that are interested in Embodied-Planner-R1 are comparing it to the libraries listed below
Sorting:
- VehicleWorld is the first comprehensive multi-device environment for intelligent vehicle interaction that accurately models the complex, …☆21Sep 16, 2025Updated 5 months ago
- This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs☆90Sep 19, 2025Updated 5 months ago
- ☆10Feb 22, 2022Updated 4 years ago
- ☆16Jun 25, 2025Updated 8 months ago
- grpo to train long form QA and instructions with long-form reward model☆17Jul 17, 2025Updated 7 months ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- Templates and examples for ACL and EMNLP conference posters.☆14Oct 5, 2024Updated last year
- Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference☆10Jul 10, 2023Updated 2 years ago
- This is the source code for Efficient Sequential Recommendation for Long Term User Interest Via Personalization.☆22Nov 18, 2025Updated 3 months ago
- ☆28Aug 22, 2025Updated 6 months ago
- ☆21Sep 25, 2025Updated 5 months ago
- Code for COLING 2022 paper "FactMix: Using a Few Labeled In-domain Examples to Generalize to Cross-domain Named Entity Recognition"☆15Jan 15, 2023Updated 3 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022☆13Apr 13, 2022Updated 3 years ago
- ☆19Jul 8, 2025Updated 7 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- Nested Named Entity Recognition for Chinese Biomedical Text☆11Jan 25, 2024Updated 2 years ago
- This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".☆18Nov 18, 2025Updated 3 months ago
- A multi-agent LaTeX translation system that converts English LaTeX documents (e.g., arXiv papers) into PDFs in other languages with a sin…☆20Sep 22, 2025Updated 5 months ago
- This is the official repository of the paper Exploring Superior Function Calls via Reinforcement Learning.☆34Aug 11, 2025Updated 6 months ago
- ☆38Dec 26, 2025Updated 2 months ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 7 months ago
- FactScoreLite is an implementation of the FactScore metric, designed for detailed accuracy assessment in text generation. This package bu…☆13Apr 25, 2024Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- ☆24Jul 20, 2025Updated 7 months ago
- Latex preprocessor — apply macro definitions, remove comments, and more☆15Aug 8, 2025Updated 6 months ago
- Official implementation of the paper “Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning”☆20Aug 20, 2025Updated 6 months ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- GitHack is a social gamification platform for GitHub☆15Oct 8, 2014Updated 11 years ago
- 为准备2020年清华机计算机复试机试题而做的笔记☆11Apr 17, 2023Updated 2 years ago
- An official implementation for the KDD 2025 paper 'Unlocking the Power of Diffusion Models in Sequential Recommendation: A Simple and Eff…☆22Jun 4, 2025Updated 8 months ago
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- A Jupyter-style custom node for executing Python code and plotting within ComfyUI workflows.☆35Dec 16, 2025Updated 2 months ago
- ☆26Jan 4, 2026Updated last month
- A Pytorch implementation for "Hierarchical Attention Network with Pairwise Loss for Chinese Zero Pronoun Resolution“ (AAAI 2020).☆10Dec 10, 2020Updated 5 years ago
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- ☆11Feb 15, 2023Updated 3 years ago
- The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…☆17Jan 15, 2024Updated 2 years ago