AviSoori1x / makeMoELinks
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
☆786Updated last year
Alternatives and similar repositories for makeMoE
Users that are interested in makeMoE are comparing it to the libraries listed below
Sorting:
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,650Updated last year
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,407Updated last year
- [COLM 2025] LIMO: Less is More for Reasoning☆1,059Updated 5 months ago
- Reference implementation of Megalodon 7B model☆527Updated 7 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆751Updated last year
- ☆973Updated 11 months ago
- Large Reasoning Models☆806Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,181Updated 4 months ago
- Recipes to scale inference-time compute of open models☆1,123Updated 7 months ago
- DataComp for Language Models☆1,404Updated 4 months ago
- Llama from scratch, or How to implement a paper without crying☆581Updated last year
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆898Updated 3 months ago
- FuseAI Project☆585Updated 11 months ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,230Updated last year
- ☆969Updated 11 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆937Updated 10 months ago
- 🍃 MINT-1T: A one trillion token multimodal interleaved dataset.☆827Updated last year
- Code for Quiet-STaR☆742Updated last year
- ☆1,035Updated last year
- Comprehensive toolkit for Reinforcement Learning from Human Feedback (RLHF) training, featuring instruction fine-tuning, reward model tra…☆181Updated last year
- OLMoE: Open Mixture-of-Experts Language Models☆945Updated 3 months ago
- [ICML 2024] CLLMs: Consistency Large Language Models☆411Updated last year
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆737Updated 7 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆814Updated 9 months ago
- Minimal hackable GRPO implementation☆309Updated 11 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆664Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆482Updated last year
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆1,087Updated 11 months ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆575Updated 3 months ago
- Codebase for Merging Language Models (ICML 2024)☆863Updated last year