allenai / olmo-cookbook
OLMost every training recipe you need to perform data interventions with the OLMo family of models.
β15Updated this week
Alternatives and similar repositories for olmo-cookbook:
Users that are interested in olmo-cookbook are comparing it to the libraries listed below
- β19Updated 3 weeks ago
- π©π€π€ A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)β23Updated last year
- β16Updated last month
- A Data Source for Reasoning Embodied Agentsβ19Updated last year
- Aioli: A unified optimization framework for language model data mixingβ22Updated 2 months ago
- β17Updated this week
- Training hybrid models for dummies.β20Updated 2 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!β14Updated last week
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, anβ¦β13Updated last week
- A list of language models with permissive licenses such as MIT or Apache 2.0β24Updated 3 weeks ago
- β15Updated 6 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.β28Updated last month
- The official Python library for Formulaicβ16Updated 11 months ago
- β13Updated 3 months ago
- A file utility for accessing both local and remote files through a unified interface.β38Updated 2 weeks ago
- Tools for merging pretrained large language models.β19Updated 9 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zetaβ13Updated 4 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ24Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the creβ¦β17Updated 5 months ago
- β11Updated last year
- Python client for txtaiβ12Updated 2 weeks ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.β18Updated 8 months ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.β25Updated 4 months ago
- code for paper "Accessing higher dimensions for unsupervised word translation"β21Updated last year
- Learning to route instances for Human vs AI Feedbackβ20Updated last month
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPOβ28Updated 2 weeks ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β26Updated 4 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't relβ¦β13Updated last year
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proceβ¦β14Updated last week
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"β30Updated last month