allenai / olmo-cookbook
OLMost every training recipe you need to perform data interventions with the OLMo family of models.
☆26Updated this week
Alternatives and similar repositories for olmo-cookbook
Users that are interested in olmo-cookbook are comparing it to the libraries listed below
Sorting:
- Training hybrid models for dummies.☆21Updated 4 months ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 7 months ago
- ☆18Updated 7 months ago
- ☆21Updated 2 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- ☆48Updated 6 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- Very minimal (and stateless) agent framework☆43Updated 4 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆16Updated last year
- ☆41Updated 5 months ago
- ☆9Updated 2 weeks ago
- ☆16Updated 2 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- A forest of autonomous agents.☆19Updated 3 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 6 months ago
- ☆31Updated 3 weeks ago
- ☆20Updated last week
- ☆64Updated last month
- Verifiers for LLM Reinforcement Learning☆50Updated last month
- ☆25Updated 3 months ago
- ☆43Updated 3 months ago
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆10Updated last year
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Updated 7 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated last month
- ☆29Updated 4 months ago
- Aioli: A unified optimization framework for language model data mixing☆25Updated 3 months ago
- Efficiently computing & storing token n-grams from large corpora☆23Updated 7 months ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- Official Repository for Task-Circuit Quantization☆20Updated 2 weeks ago
- GoldFinch and other hybrid transformer components☆10Updated this week