akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆71Updated last year
Alternatives and similar repositories for Arithmo:
Users that are interested in Arithmo are comparing it to the libraries listed below
- ☆117Updated 4 months ago
- This is the official repository for Inheritune.☆109Updated last week
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆115Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 4 months ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 11 months ago
- ☆87Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 5 months ago
- ☆48Updated 3 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- Pre-training code for CrystalCoder 7B LLM☆55Updated 9 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆52Updated 4 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 6 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆101Updated 6 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆113Updated 5 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆215Updated 3 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- ☆58Updated 9 months ago
- Open Implementations of LLM Analyses☆98Updated 4 months ago
- ☆40Updated 9 months ago
- ☆94Updated last year
- ☆108Updated 3 weeks ago
- ☆116Updated 3 months ago
- Code repository for the c-BTM paper☆105Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆107Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆94Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆137Updated 2 weeks ago
- Functional Benchmarks and the Reasoning Gap☆82Updated 4 months ago
- ☆27Updated this week