Small and Efficient Mathematical Reasoning LLMs
☆73Jan 27, 2024Updated 2 years ago
Alternatives and similar repositories for Arithmo
Users that are interested in Arithmo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for NeurIPS LLM Efficiency Challenge☆60Apr 9, 2024Updated 2 years ago
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Sep 12, 2024Updated last year
- Implementation of Concept-level Debugging of Part-Prototype Networks☆12May 9, 2023Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆50Mar 19, 2024Updated 2 years ago
- ☆10Oct 24, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 3 years ago
- codebase release for EMNLP2023 paper publication☆19Sep 18, 2025Updated 6 months ago
- Inference code for Persimmon-8B☆412Sep 9, 2023Updated 2 years ago
- The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understandi…☆18Aug 7, 2024Updated last year
- ☆41Nov 30, 2023Updated 2 years ago
- ☆10Mar 5, 2024Updated 2 years ago
- MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models☆454Feb 1, 2024Updated 2 years ago
- ☆32Jul 5, 2024Updated last year
- Scaling Data-Constrained Language Models☆343Jun 28, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆117Jun 28, 2025Updated 9 months ago
- StAtutory Reasoning Assessment☆17Dec 8, 2022Updated 3 years ago
- ☆17Apr 7, 2025Updated last year
- Sample solution for MLOps Marathon 2023☆29Jun 25, 2023Updated 2 years ago
- GPT* - Training faster small transformers using ALiBi, Parallel Residual Connections and more!☆21Oct 29, 2022Updated 3 years ago
- ☆56Jun 26, 2025Updated 9 months ago
- ☆22Aug 8, 2025Updated 8 months ago
- A collection of hand on notebook for LLMs practitioner☆52Jan 13, 2025Updated last year
- Training Proactive and Personalized LLM Agents☆106Jan 20, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆32Jun 16, 2024Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Jul 9, 2023Updated 2 years ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆100Apr 9, 2025Updated last year
- A repository to perform self-instruct with a model on HF Hub☆32Sep 29, 2023Updated 2 years ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆270Sep 12, 2024Updated last year
- ☆40Mar 25, 2023Updated 3 years ago
- ☆63Sep 23, 2024Updated last year
- ☆19Feb 20, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jan 20, 2023Updated 3 years ago
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated last year
- Code repo for CLERC: A Legal Precedent Dataset for Case Retrieval and Retrieval-Augmented Analysis Generation (NAACL 2025)☆28Jan 28, 2025Updated last year
- A Data Science pipeline for Algorithmic Trading: A comparative study in applications to Finance and cryptoeconomics☆14Jul 1, 2022Updated 3 years ago
- Translation between Traditional Chinese and Simplified Chinese. 繁简转换。☆14May 12, 2015Updated 10 years ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆68Feb 5, 2025Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆210Jan 13, 2024Updated 2 years ago