Train the smallest LM you can that fits in 16MB. Best model wins!
☆5,048May 4, 2026Updated 3 weeks ago
Alternatives and similar repositories for parameter-golf
Users that are interested in parameter-golf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Dec 26, 2023Updated 2 years ago
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆18May 23, 2025Updated last year
- NanoGPT (124M) in 90 seconds☆5,270May 14, 2026Updated 2 weeks ago
- Alpha-Zero Connect Four NN trained via self play☆27Mar 7, 2025Updated last year
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated 2 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Marathon: A Multiple-choice Long Context Evaluation Benchmark for Large Language Models.☆10May 16, 2024Updated 2 years ago
- ☆36Mar 7, 2025Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- Gradient Informed, GPU Accelerated Lens modelling (GIGALens) -- a package for fast Bayesian inference on strong gravitational lenses.☆12May 20, 2026Updated last week
- Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures☆33Jan 29, 2026Updated 4 months ago
- ☆14Dec 12, 2024Updated last year
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,188Aug 26, 2025Updated 9 months ago
- Our library for RL environments + evals☆4,125May 22, 2026Updated last week
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- ☆53Feb 20, 2026Updated 3 months ago
- Minimal reproduction of DeepSeek R1-Zero☆13,104Feb 27, 2026Updated 3 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆57Dec 4, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 9 months ago
- [NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios☆36Dec 1, 2025Updated 5 months ago
- VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding☆58May 1, 2026Updated 3 weeks ago
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆20Mar 2, 2024Updated 2 years ago
- ☆28Oct 22, 2024Updated last year
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆23Jul 4, 2025Updated 10 months ago
- ☆188Nov 26, 2025Updated 6 months ago
- The official implementation of "PixelThink: Towards Efficient Chain-of-Pixel Reasoning" (ICML 2026)☆42May 30, 2025Updated last year
- The first AI agent with *agency*☆43Apr 20, 2026Updated last month
- ☆19Mar 25, 2025Updated last year
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- Mini Bayesian Optimization package for ACML2020 Tutorial on Bayesian Optimization☆15Sep 30, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation for Decision-focused Summarization (EMNLP2021)☆12Mar 14, 2022Updated 4 years ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆50Dec 22, 2023Updated 2 years ago
- 🚀 Efficient implementations for emerging model architectures☆5,139Updated this week
- s1: Simple test-time scaling☆6,655Jun 25, 2025Updated 11 months ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,514Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆192Jan 19, 2026Updated 4 months ago
- Minimal open-source implementation of AlphaProof and HyperTree Proof Search.☆87May 13, 2026Updated 2 weeks ago