NJUDeepEngine / CAEFLinks

Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"

☆12

Alternatives and similar repositories for CAEF

Users that are interested in CAEF are comparing it to the libraries listed below

Sorting:

princeton-nlp / ELIZA-Transformer
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆22Updated 9 months ago
thunlp / APB
Official Implementation of APB (ACL 2025 main Oral)
☆31Updated 8 months ago
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆36Updated last year
chenllliang / MMEvalPro
[NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs
☆24Updated last year
DualityRL / multi-attempt
☆19Updated 7 months ago
yale-nlp / refdpo
☆16Updated last year
Infini-AI-Lab / gsm_infinite
☆55Updated 4 months ago
ulab-uiuc / ToMAP
Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"
☆20Updated last month
cognitiveailab / GPT-simulator
☆29Updated last year
kyegomez / Reka-Torch
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆28Updated 2 weeks ago
RUCAIBox / JiuZhang3.0
The code and data for the paper JiuZhang3.0
☆49Updated last year
Gen-Verse / CURE
[NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning
☆131Updated last month
VITA-Group / o1-planning
On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability
☆41Updated 4 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆40Updated 3 weeks ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
Fu-Dayuan / PreAct
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆30Updated 10 months ago
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year
SkyworkAI / MindLink
☆98Updated 3 months ago
ByteDance-Seed / WideSearch
WideSearch: Benchmarking Agentic Broad Info-Seeking
☆98Updated last month
locuslab / scaling_laws_data_filtering
☆65Updated last year
LCM-Lab / LOGO
Code for paper: Long cOntext aliGnment via efficient preference Optimization
☆23Updated 3 weeks ago
jdf-prog / LLM-Engines
☆50Updated 5 months ago
Lagooon / LeanSTaR
☆42Updated last year
shenao-zhang / BARL
Bayes-Adaptive RL for LLM Reasoning
☆40Updated 5 months ago
DAMO-NLP-SG / LongPO
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆42Updated 8 months ago
mathllm / Step-Controlled_DPO
☆23Updated last year
SjJ1017 / CiteLab
☆17Updated 3 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆33Updated last year
GAIR-NLP / OlympicArena
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆108Updated 8 months ago
TIGER-AI-Lab / AceCoder
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆93Updated 7 months ago