allenai / dolma
Data and tools for generating and inspecting OLMo pre-training data.
☆1,196Updated this week
Alternatives and similar repositories for dolma:
Users that are interested in dolma are comparing it to the libraries listed below
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,414Updated this week
- YaRN: Efficient Context Window Extension of Large Language Models☆1,463Updated 11 months ago
- AllenAI's post-training codebase☆2,898Updated this week
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆830Updated this week
- [ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding☆1,236Updated last month
- Minimalistic large language model 3D-parallelism training☆1,786Updated this week
- Evaluation suite for LLMs☆345Updated 2 weeks ago