allenai / olmo-cookbookLinks
OLMost every training recipe you need to perform data interventions with the OLMo family of models.
☆32Updated this week
Alternatives and similar repositories for olmo-cookbook
Users that are interested in olmo-cookbook are comparing it to the libraries listed below
Sorting:
- ☆21Updated 3 weeks ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 9 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- Official Repository for Task-Circuit Quantization☆20Updated 3 weeks ago
- ☆41Updated 6 months ago
- Verifiers for LLM Reinforcement Learning☆60Updated 2 months ago
- Training hybrid models for dummies.☆23Updated 5 months ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Simple repository for training small reasoning models☆33Updated 4 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 7 months ago
- Testing paligemma2 finetuning on reasoning dataset☆18Updated 5 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 3 months ago
- ☆29Updated 5 months ago
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆23Updated 2 months ago
- ☆21Updated 3 months ago
- ☆47Updated 4 months ago
- ☆16Updated last year
- ☆20Updated 3 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆20Updated 6 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- ☆16Updated 3 months ago
- Reasoning by Communicating with Agents☆29Updated last month
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆18Updated last month
- ☆65Updated 2 months ago
- ☆13Updated 6 months ago
- ☆61Updated 3 weeks ago
- NLP with Rust for Python 🦀🐍☆62Updated last month
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year