thu-coai / CodePlanLinks

☆15

Alternatives and similar repositories for CodePlan

Users that are interested in CodePlan are comparing it to the libraries listed below

Sorting:

thu-coai / SPaR
☆47Updated 2 weeks ago
ernie-research / Tool-Augmented-Reward-Model
[ICLR'24 spotlight] Tool-Augmented Reward Modeling
☆50Updated 3 weeks ago
ignorejjj / LongRefiner
The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation
☆22Updated 3 weeks ago
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆25Updated 3 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆32Updated last year
THUDM / ChatGLM-Math
☆82Updated last year
yale-nlp / refdpo
☆16Updated 11 months ago
Fu-Dayuan / PreAct
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆28Updated 6 months ago
plm-team / PLM
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing
☆19Updated 3 months ago
THUDM / LongReward
☆56Updated 8 months ago
Bui1dMySea / MemLong
☆94Updated 6 months ago
FreedomIntelligence / FastLLM
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆39Updated last year
multimodal-art-projection / COIG-P
☆38Updated 2 months ago
nishiwen1214 / Benchmark-leakage-detection
Official completion of “Training on the Benchmark Is Not All You Need”.
☆34Updated 5 months ago
lemon-prog123 / LongRePS
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision
☆15Updated 2 months ago
wwxu21 / CUT
Source code of "Reasons to Reject? Aligning Language Models with Judgments"
☆58Updated last year
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆37Updated 4 months ago
yongchao98 / PROMST
Automatic prompt optimization framework for multi-step agent tasks.
☆31Updated 7 months ago
Zheng0428 / COIG-Kun
☆36Updated 9 months ago
agiresearch / AutoFlow
☆71Updated 9 months ago
yyDing1 / ScaleQuest
[ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.
☆63Updated 8 months ago
zjunlp / TRICE
[NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback
☆41Updated last year
Tomorrowdawn / top_nsigma
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆26Updated 4 months ago
TianHongZXY / CoRe
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)
☆49Updated last year
Gen-Verse / ScoreFlow
Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"
☆78Updated last month
InternLM / Condor
[ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement
☆30Updated last month
hzy312 / knowledge-r1
IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent
☆58Updated last month
RUCAIBox / RLMEC
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆38Updated last year
GAIR-NLP / OPO
☆50Updated last year
kaiyuhwang / MLLM-Survey
The paper list of multilingual pre-trained models (Continual Updated).
☆22Updated last year