dvlab-research / MoTCoderLinks

This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.

☆85

Alternatives and similar repositories for MoTCoder

Users that are interested in MoTCoder are comparing it to the libraries listed below

Sorting:

luo-junyu / RobustFT
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
☆42Updated last year
Ledzy / StreamBP
Official code of "StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs".
☆74Updated 6 months ago
luo-junyu / SemiEvol
SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation
☆58Updated 8 months ago
yongchao98 / CodeSteer-v1.0
Code and dataset of CodeSteer
☆87Updated 9 months ago
OPPO-PersonalAI / TaskCraft
A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.
☆176Updated 5 months ago
syr-cn / AutoRefine
[NeurIPS 2025 Poster] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning
☆116Updated 3 weeks ago
IAAR-Shanghai / Grimoire
Grimoire is All You Need for Enhancing Large Language Models
☆117Updated last year
uw-nsl / TinyV
Your efficient and accurate answer verification system for RL training.
☆43Updated 6 months ago
liuzuyan / ElasticCache
[ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache
☆42Updated last year
RLHFlow / Self-rewarding-reasoning-LLM
Recipes to train the self-rewarding reasoning LLMs.
☆229Updated 10 months ago
IAAR-Shanghai / ICSFSurvey
Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…
☆171Updated last year
WeixiangYAN / CodeTransOcean
[EMNLP 2023] CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation
☆58Updated 2 years ago
SunzeY / SEAgent
Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"
☆216Updated 4 months ago
Yueeeeeeee / HRPO
[NeurIPS 2025] Hybrid Latent Reasoning via Reinforcement Learning
☆168Updated 3 months ago
Ablustrund / MPLSandbox
MPLSandbox is an out-of-the-box multi-programming language sandbox designed to provide unified and comprehensive feedback from compiler a…
☆178Updated 8 months ago
HSLiu-Initial / CtrlA
This includes the original implementation of CtrlA: Adaptive Retrieval-Augmented Generation via Inherent Control.
☆64Updated last year
Mercury7353 / PyBench
LLM Benchmark for Code
☆32Updated last year
Qcompiler / vllm-mixed-precision
Support mixed-precsion inference with vllm
☆84Updated 5 months ago
OPPOMKLab / u-LLaVA
u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model
☆134Updated 8 months ago
yiyihum / da-code
[EMNLP 2024] DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models
☆87Updated 5 months ago
DCDmllm / WorldGPT
WorldGPT: Empowering LLM as Multimodal World Model
☆122Updated last year
microsoft / MMLU-CF
A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]
☆122Updated 7 months ago
MrYxJ / enhance_long
This tool(enhance_long) aims to enhance the LlaMa2 long context extrapolation capability in the lowest-cost approach, preferably without …
☆45Updated 2 years ago
huangd1999 / EffiBench
[NeurIPS 2024] EffiBench: Benchmarking the Efficiency of Automatically Generated Code
☆59Updated last year
Baiqi-Li / NaturalBench
🚀 [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'2…
☆89Updated 6 months ago
zhiyuanhubj / Meta-Ability-Alignment
Official code of paper "Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models"
☆83Updated 7 months ago
GreenBitAI / green-bit-llm
A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.
☆187Updated 5 months ago
OpenDCAI / RARE
Official implementation of RARE: Retrieval-Augmented Reasoning Modeling
☆185Updated 7 months ago
WeixiangYAN / CodeScope
[ACL 2024] CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and …
☆100Updated last year
tencent-ailab / Leopard
The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"
☆158Updated last year