100M tokens, no time limit, best val loss wins!
☆103Updated this week
Alternatives and similar repositories for slowrun
Users that are interested in slowrun are comparing it to the libraries listed below
Sorting:
- ☆25Feb 20, 2026Updated last week
- Code for "What really matters in matrix-whitening optimizers?"☆21Oct 31, 2025Updated 4 months ago
- Leo optimizer, variation of Muon that runs faster☆57Sep 6, 2025Updated 5 months ago
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago
- ☆45Jul 21, 2025Updated 7 months ago
- Benchmarking Optimizers for LLM Pretraining☆52Dec 30, 2025Updated 2 months ago
- toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts☆31Sep 1, 2024Updated last year
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆62Updated this week
- ☆36Feb 26, 2024Updated 2 years ago
- ☆56Sep 17, 2025Updated 5 months ago
- ☆35Jul 5, 2023Updated 2 years ago
- Timelight: Universal Path Generator☆22Aug 24, 2025Updated 6 months ago
- ☆34Feb 6, 2026Updated 3 weeks ago
- ☆13Oct 5, 2025Updated 4 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆131Feb 21, 2026Updated last week
- LawGuru, a state-of-the-art web app, utilizes AI chatbot technology, providing personalized legal assistance to simplify the complexities…☆14Apr 16, 2025Updated 10 months ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- ☆14Apr 29, 2025Updated 10 months ago
- Model architecture for the ThinkOnward Geophysical Foundation Model☆16May 16, 2025Updated 9 months ago
- Metadata Enchanced Collection Orientated Music Player☆18Feb 6, 2026Updated 3 weeks ago
- UFOs and free energy!☆42Dec 31, 2025Updated 2 months ago
- Durability for web streams powered by S2☆22Jan 2, 2026Updated last month
- A low-cost remote vital signs monitor for home patients☆12Feb 19, 2026Updated last week
- Source code related to the paper "Passive Channel Charting: Locating Passive Targets using Wi-Fi Channel State Information"☆28Jul 5, 2025Updated 7 months ago
- AgRec is an open source Agriculture Recommendations from the Cooperative Extension Services.☆12Jan 8, 2022Updated 4 years ago
- ☆18May 3, 2025Updated 9 months ago
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated last year
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- A Label Mark tool using for deep learning☆12Jun 9, 2018Updated 7 years ago
- Utility functions for weights and biases (wandb).☆11Sep 17, 2024Updated last year
- AI assisted article writing project☆13Jan 31, 2025Updated last year
- A GPU accelerated Mandelbrot viewer made using the new WebGPU API.☆10Oct 26, 2023Updated 2 years ago
- ☆24Feb 13, 2026Updated 2 weeks ago
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- Pytorch routines for (Ker)nel (Mac)hines☆10Oct 10, 2025Updated 4 months ago
- Tutorials for MATH 4432 Statistical Machine Learning, HKUST, Fall 2022☆11Sep 17, 2024Updated last year
- Build a Slurm Cluster using SaltStack in virtual machines☆12Nov 26, 2018Updated 7 years ago
- An app that autofills when2meet based on your google calendar☆10May 22, 2023Updated 2 years ago