akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated 11 months ago
Alternatives and similar repositories for Arithmo:
Users that are interested in Arithmo are comparing it to the libraries listed below
- Evaluating LLMs with CommonGen-Lite☆87Updated 9 months ago
- ☆115Updated 3 months ago
- This is the official repository for Inheritune.☆108Updated 3 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 3 months ago
- ☆87Updated 11 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆50Updated 3 months ago
- ☆46Updated 2 months ago
- ☆89Updated this week
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆153Updated last month
- Data preparation code for CrystalCoder 7B LLM☆43Updated 8 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆115Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆83Updated 3 weeks ago
- Code repository for the c-BTM paper☆105Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 4 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆126Updated 2 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆129Updated 2 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 9 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆90Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆80Updated 10 months ago
- ☆47Updated 4 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆66Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆116Updated 5 months ago
- ☆74Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆113Updated 4 months ago
- ☆93Updated 6 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆30Updated 3 months ago
- The official repo for "LLoCo: Learning Long Contexts Offline"☆114Updated 7 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated 10 months ago