NJUDeepEngine / CAEFLinks
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Updated 7 months ago
Alternatives and similar repositories for CAEF
Users that are interested in CAEF are comparing it to the libraries listed below
Sorting:
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Updated 8 months ago
- ☆45Updated 3 months ago
- The code and data for the paper JiuZhang3.0☆45Updated last year
- ☆24Updated 11 months ago
- Official Implementation of APB (ACL 2025 main)☆28Updated 3 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆86Updated last month
- ☆49Updated 3 weeks ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆21Updated 3 months ago
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆39Updated last month
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆28Updated last month
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆39Updated last month
- Codebase for Instruction Following without Instruction Tuning☆34Updated 8 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆47Updated last year
- The official implementation for Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free☆40Updated 3 weeks ago
- ☆20Updated 7 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆28Updated 5 months ago
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆29Updated 2 weeks ago
- ☆18Updated 2 months ago
- A multimodal agent that can interact with its own PC in a multimodal manner.☆24Updated this week
- ☆40Updated 3 weeks ago
- ☆39Updated this week
- ARM: Adaptive Reasoning Model☆33Updated last week
- ☆47Updated 2 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 5 months ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆22Updated 6 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated last year
- Unsupervised GRPO☆24Updated this week
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 3 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆18Updated 7 months ago
- "Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiw…☆29Updated last year