allenai / OLMo-coreLinks
PyTorch building blocks for the OLMo ecosystem
β269Updated this week
Alternatives and similar repositories for OLMo-core
Users that are interested in OLMo-core are comparing it to the libraries listed below
Sorting:
- Reproducible, flexible LLM evaluationsβ226Updated 3 weeks ago
- πΎ OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.β425Updated last week
- A project to improve skills of large language modelsβ501Updated this week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β344Updated 7 months ago
- A simple unified framework for evaluating LLMsβ235Updated 3 months ago
- Automatic evals for LLMsβ496Updated last month
- β206Updated 5 months ago
- β174Updated last month
- Tina: Tiny Reasoning Models via LoRAβ274Updated 2 months ago
- Decentralized RL Training at Scaleβ400Updated this week
- The HELMET Benchmarkβ162Updated 3 months ago
- Evaluation of LLMs on latest math competitionsβ155Updated 2 weeks ago
- A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.β119Updated 5 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).β263Updated last week
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.β208Updated 3 months ago
- Simple & Scalable Pretraining for Neural Architecture Researchβ277Updated last week
- Benchmarking LLMs with Challenging Tasks from Real Usersβ233Updated 9 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)β190Updated last week
- Scalable toolkit for efficient model reinforcementβ558Updated this week
- Manage scalable open LLM inference endpoints in Slurm clustersβ268Updated last year
- A scalable asynchronous reinforcement learning implementation with in-flight weight updates.β134Updated this week
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"β306Updated last year
- The official evaluation suite and dynamic data release for MixEval.β242Updated 8 months ago
- An extension of the nanoGPT repository for training small MOE models.β164Updated 4 months ago
- Accelerating your LLM training to full speed! Made with β€οΈ by ServiceNow Researchβ217Updated this week
- Parallel Scaling Law for Language Model β Beyond Parameter and Inference Time Scalingβ417Updated 2 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024β323Updated 3 months ago
- code for training & evaluating Contextual Document Embedding modelsβ196Updated 2 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".β207Updated 2 months ago
- Physics of Language Models, Part 4β204Updated last week