Train the smallest LM you can that fits in 16MB. Best model wins!
☆4,701Mar 30, 2026Updated 2 weeks ago
Alternatives and similar repositories for parameter-golf
Users that are interested in parameter-golf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆17May 23, 2025Updated 10 months ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- Training tiny models to prove hard theorems☆72Mar 5, 2026Updated last month
- NanoGPT (124M) in 2 minutes☆5,070Mar 29, 2026Updated 2 weeks ago
- Alpha-Zero Connect Four NN trained via self play☆27Mar 7, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated last year
- ☆36Mar 7, 2025Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆33Jan 29, 2026Updated 2 months ago
- ☆14Dec 12, 2024Updated last year
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 4 months ago
- Democratizing Reinforcement Learning for LLMs☆5,402Updated this week
- Our library for RL environments + evals☆3,986Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Easy-to-use Retrieval-Enhanced Transformer implementation☆10Sep 30, 2022Updated 3 years ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,146Aug 26, 2025Updated 7 months ago
- ☆48Feb 20, 2026Updated last month
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆57Dec 4, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 8 months ago
- [NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios☆30Dec 1, 2025Updated 4 months ago
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 6 months ago
- ☆28Oct 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for testing DCT plus Sparse (DCTpS) networks☆14Jun 15, 2021Updated 4 years ago
- ☆185Nov 26, 2025Updated 4 months ago
- ☆19Mar 25, 2025Updated last year
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- Mini Bayesian Optimization package for ACML2020 Tutorial on Bayesian Optimization☆15Sep 30, 2022Updated 3 years ago
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.☆26Jun 6, 2025Updated 10 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Dec 22, 2023Updated 2 years ago
- Benchmark of LLMs on real open-source projects against dependency hell, legacy toolchains, and complex build systems.☆54Dec 23, 2025Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Minimal open-source implementation of AlphaProof and HyperTree Proof Search.☆78Updated this week
- Implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers☆50Feb 5, 2026Updated 2 months ago
- ☆29Apr 7, 2026Updated last week
- Minimal reproduction of DeepSeek R1-Zero☆13,038Feb 27, 2026Updated last month
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆190Jan 19, 2026Updated 2 months ago
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 6 months ago
- Measuring how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours☆262Apr 7, 2026Updated last week