facebookresearch / moodistLinks
moodist
โ24Updated this week
Alternatives and similar repositories for moodist
Users that are interested in moodist are comparing it to the libraries listed below
Sorting:
- โ55Updated last year
- A MAD laboratory to improve AI architecture designs ๐งชโ136Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT trainingโ132Updated last year
- Can Language Models Solve Olympiad Programming?โ124Updated 11 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.โ181Updated 6 months ago
- Fluid Language Model Benchmarkingโ25Updated 3 months ago
- โ33Updated last year
- โ150Updated 4 months ago
- โ213Updated last month
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"โ87Updated last year
- Applying SAEs for fine-grained controlโ25Updated last year
- โ53Updated last year
- โ127Updated 2 months ago
- A set of Python scripts that makes your experience on TPU betterโ54Updated 3 months ago
- โ91Updated last year
- Official Repo for InSTA: Towards Internet-Scale Training For Agentsโ55Updated 6 months ago
- Minimum Description Length probing for neural network representationsโ20Updated 11 months ago
- โ178Updated 3 weeks ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)โ198Updated last year
- โ23Updated 11 months ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)โ19Updated 11 months ago
- seqax = sequence modeling + JAXโ169Updated 5 months ago
- โ27Updated 3 months ago
- Minimal but scalable implementation of large language models in JAXโ35Updated last month
- โ80Updated 3 months ago
- Python package for generating datasets to evaluate reasoning and retrieval of large language modelsโ18Updated 3 months ago
- some common Huggingface transformers in maximal update parametrization (ยตP)โ87Updated 3 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paperโ132Updated 3 years ago
- Evaluation of LLMs on latest math competitionsโ211Updated 2 weeks ago
- Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)โ30Updated 3 months ago