thu-coai / CodePlan
☆16Updated 6 months ago
Alternatives and similar repositories for CodePlan
Users that are interested in CodePlan are comparing it to the libraries listed below
Sorting:
- ☆47Updated 4 months ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆31Updated 4 months ago
- ☆81Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated 11 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆47Updated 4 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆44Updated this week
- Automatic prompt optimization framework for multi-step agent tasks.☆30Updated 6 months ago
- ☆36Updated last month
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- ☆94Updated 5 months ago
- ☆49Updated last year
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 7 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆37Updated last year
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆48Updated 10 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆47Updated last year
- The paper list of multilingual pre-trained models (Continual Updated).☆21Updated 10 months ago
- ☆24Updated 4 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆43Updated 5 months ago
- ☆35Updated last year
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆26Updated 5 months ago
- FuseAI Project☆86Updated 3 months ago
- ☆20Updated 6 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆49Updated last year
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Updated 8 months ago
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆75Updated 10 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆84Updated 7 months ago
- Revisiting Mid-training in the Era of RL Scaling☆37Updated 3 weeks ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Updated last year
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆68Updated 2 months ago