mlfoundations / dclmLinks
DataComp for Language Models
☆1,404Updated 4 months ago
Alternatives and similar repositories for dclm
Users that are interested in dclm are comparing it to the libraries listed below
Sorting:
- Minimalistic large language model 3D-parallelism training☆2,407Updated 3 weeks ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆2,233Updated this week
- Data and tools for generating and inspecting OLMo pre-training data.☆1,393Updated 2 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆945Updated 3 months ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,650Updated last year
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,812Updated this week
- Recipes to scale inference-time compute of open models☆1,123Updated 7 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,536Updated 7 months ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,654Updated last year
- Scalable data pre processing and curation toolkit for LLMs