PrimeIntellect-ai / INTELLECT-MATH
A 7B parameter model for mathematical reasoning
☆34Updated 3 months ago
Alternatives and similar repositories for INTELLECT-MATH
Users that are interested in INTELLECT-MATH are comparing it to the libraries listed below
Sorting:
- ☆125Updated last month
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated 2 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated 2 months ago
- Simple repository for training small reasoning models☆27Updated 3 months ago
- ☆56Updated last week
- Repository for the paper Stream of Search: Learning to Search in Language☆146Updated 3 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆51Updated 3 weeks ago
- Train your own SOTA deductive reasoning model☆92Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 4 months ago
- Replicating O1 inference-time scaling laws☆85Updated 5 months ago
- ☆74Updated 3 weeks ago
- ☆47Updated 8 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated last month
- Modded vLLM to run pipeline parallelism over public networks☆33Updated this week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- train with kittens!☆57Updated 6 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆97Updated 3 weeks ago
- Collection of LLM completions for reasoning-gym task datasets☆20Updated this week
- Open source interpretability artefacts for R1.☆109Updated 3 weeks ago
- Experiments for efforts to train a new and improved t5☆77Updated last year
- ☆60Updated last year
- look how they massacred my boy☆63Updated 7 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 2 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 7 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆72Updated 9 months ago
- Simple GRPO scripts and configurations.☆58Updated 3 months ago
- ☆129Updated last month
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆60Updated 6 months ago
- ☆43Updated last year
- Lego for GRPO☆28Updated last month