mlfoundations / dclmLinks

DataComp for Language Models

☆1,375

Alternatives and similar repositories for dclm

Users that are interested in dclm are comparing it to the libraries listed below

Sorting:

huggingface / nanotron
Minimalistic large language model 3D-parallelism training
☆2,267Updated last month
allenai / dolma
Data and tools for generating and inspecting OLMo pre-training data.
☆1,332Updated 3 weeks ago
XueFuzhao / OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
☆1,614Updated last year
huggingface / lighteval
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends
☆2,021Updated this week
allenai / OLMoE
OLMoE: Open Mixture-of-Experts Language Models
☆888Updated last month
NVIDIA / NeMo-Aligner
Scalable toolkit for efficient model alignment
☆842Updated 2 weeks ago
microsoft / MInference
[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention…
☆1,141Updated 3 weeks ago
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,111Updated 5 months ago
jiaweizzhao / GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
☆1,610Updated 11 months ago
allenai / open-instruct
AllenAI's post-training codebase
☆3,252Updated last week
huggingface / datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
☆2,673Updated last week
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,336Updated 2 months ago
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,619Updated last year
PRIME-RL / PRIME
Scalable RL solution for advanced reasoning of language models
☆1,751Updated 7 months ago
facebookresearch / MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
☆1,379Updated 6 months ago
hao-ai-lab / LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
☆1,288Updated 7 months ago
jzhang38 / EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆747Updated last year
NVIDIA-NeMo / Curator
Scalable data pre processing and curation toolkit for LLMs
☆1,183Updated this week
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,903Updated this week
karpathy / nano-llama31
nanoGPT style version of Llama 3.1
☆1,438Updated last year
trotsky1997 / MathBlackBox
☆1,035Updated 10 months ago
AIDC-AI / Marco-o1
An Open Large Reasoning Model for Real-World Solutions
☆1,522Updated 4 months ago
lmarena / arena-hard-auto
Arena-Hard-Auto: An automatic LLM benchmark.
☆942Updated 4 months ago
multimodal-art-projection / MAP-NEO
☆964Updated 8 months ago
huggingface / picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
☆1,856Updated last month
mlc-ai / xgrammar
Fast, Flexible and Portable Structured Generation
☆1,309Updated last week
mit-han-lab / llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
☆3,318Updated 3 months ago
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆915Updated 5 months ago
vllm-project / llm-compressor
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
☆2,106Updated this week
SimpleBerry / LLaMA-O1
Large Reasoning Models
☆805Updated 10 months ago