thu-coai / CodePlanLinks
☆15Updated 8 months ago
Alternatives and similar repositories for CodePlan
Users that are interested in CodePlan are comparing it to the libraries listed below
Sorting:
- ☆47Updated 2 weeks ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 3 weeks ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation☆22Updated 3 weeks ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 3 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆82Updated last year
- ☆16Updated 11 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆28Updated 6 months ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆19Updated 3 months ago
- ☆56Updated 8 months ago
- ☆94Updated 6 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆39Updated last year
- ☆38Updated 2 months ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆34Updated 5 months ago
- Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision☆15Updated 2 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Updated last year
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 4 months ago
- Automatic prompt optimization framework for multi-step agent tasks.☆31Updated 7 months ago
- ☆36Updated 9 months ago
- ☆71Updated 9 months ago
- [ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆63Updated 8 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆41Updated last year
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Updated 4 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆49Updated last year
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆30Updated last month
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆58Updated last month
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year
- ☆50Updated last year
- The paper list of multilingual pre-trained models (Continual Updated).☆22Updated last year