NJUDeepEngine / CAEF
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Updated last month
Related projects ⓘ
Alternatives and complementary repositories for CAEF
- Representing Rule-based Chatbots with Transformers☆18Updated 4 months ago
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated last month
- ☆30Updated this week
- ☆19Updated 5 months ago
- Code for https://arxiv.org/abs/2401.17139 (NeurIPS 2024)☆25Updated last week
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆44Updated 10 months ago
- Official repository for Decentralized Arena via Collective LLM Intelligence☆8Updated last month
- The code and data for the paper JiuZhang3.0☆35Updated 5 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆46Updated 2 weeks ago
- ☆15Updated 3 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆43Updated 4 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆13Updated 10 months ago
- ☆18Updated last week
- Codebase for Instruction Following without Instruction Tuning☆32Updated last month
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆28Updated 5 months ago
- ☆21Updated 5 months ago
- ☆22Updated 2 months ago
- ☆25Updated 2 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆38Updated 4 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆33Updated this week
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆69Updated last month
- ☆50Updated last month
- [ACL 2024] The project of Symbol-LLM☆42Updated 4 months ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆19Updated 2 weeks ago
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆86Updated last month
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆36Updated 3 months ago
- ☆17Updated 4 months ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆16Updated 2 weeks ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆18Updated last week