Small and Efficient Mathematical Reasoning LLMs
☆73Jan 27, 2024Updated 2 years ago
Alternatives and similar repositories for Arithmo
Users that are interested in Arithmo are comparing it to the libraries listed below
Sorting:
- Code for NeurIPS LLM Efficiency Challenge☆60Apr 9, 2024Updated last year
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆21Feb 23, 2026Updated last week
- ☆10Oct 24, 2024Updated last year
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Sep 12, 2024Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Mar 19, 2024Updated last year
- ☆20Jun 16, 2025Updated 8 months ago
- Extract structured data from PDF invoices☆14Mar 16, 2021Updated 4 years ago
- ☆17Apr 7, 2025Updated 10 months ago
- StAtutory Reasoning Assessment☆16Dec 8, 2022Updated 3 years ago
- ☆40Mar 25, 2023Updated 2 years ago
- codebase release for EMNLP2023 paper publication☆19Sep 18, 2025Updated 5 months ago
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆20Nov 19, 2024Updated last year
- Scaling Data-Constrained Language Models☆342Jun 28, 2025Updated 8 months ago
- ☆16Jun 20, 2023Updated 2 years ago
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆77Aug 30, 2023Updated 2 years ago
- MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models☆454Feb 1, 2024Updated 2 years ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Jun 13, 2023Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- ☆38Feb 1, 2026Updated last month
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆24Sep 26, 2024Updated last year
- ☆24Feb 6, 2023Updated 3 years ago
- Inference code for Persimmon-8B☆412Sep 9, 2023Updated 2 years ago
- A multi-purpose LLM framework for RAG and data creation.☆629Jan 13, 2024Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆209Jan 13, 2024Updated 2 years ago
- ☆63Sep 23, 2024Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Apr 4, 2023Updated 2 years ago
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Jan 9, 2024Updated 2 years ago
- GPT-4 Passes the Bar☆28Dec 19, 2023Updated 2 years ago
- Claude-router is a best project for using open model in claude-code☆55Sep 4, 2025Updated 6 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆74Nov 4, 2025Updated 4 months ago
- ☆27Oct 18, 2021Updated 4 years ago
- ☆32Jul 5, 2024Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆255Nov 10, 2024Updated last year
- Robust recipes to align language models with human and AI preferences☆5,510Sep 8, 2025Updated 5 months ago
- ☆62Dec 8, 2023Updated 2 years ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆270Sep 12, 2024Updated last year