allenai / OLMo-coreLinks

PyTorch building blocks for the OLMo ecosystem

☆307

Alternatives and similar repositories for OLMo-core

Users that are interested in OLMo-core are comparing it to the libraries listed below

Sorting:

allenai / olmes
Reproducible, flexible LLM evaluations
☆257Updated last week
NVIDIA-NeMo / Skills
A project to improve skills of large language models
☆587Updated last week
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆342Updated 10 months ago
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆539Updated this week
huggingface / fineweb-2
☆200Updated 4 months ago
shangshang-wang / Tina
Tina: Tiny Reasoning Models via LoRA
☆299Updated last month
ServiceNow / PipelineRL
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
☆260Updated last week
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆202Updated 7 months ago
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆250Updated 11 months ago
huggingface / picotron_tutorial
☆224Updated last week
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆273Updated last year
changjonathanc / flex-nano-vllm
FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.
☆300Updated 2 months ago
WildEval / ZeroEval
A simple unified framework for evaluating LLMs
☆254Updated 6 months ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆297Updated 2 months ago
OSU-NLP-Group / GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆233Updated 3 months ago
mlfoundations / evalchemy
Automatic evals for LLMs
☆550Updated 4 months ago
booydar / babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆215Updated last month
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆242Updated 11 months ago
facebookresearch / RAM
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
☆295Updated this week
Mohammadjafari80 / GSM8K-RLVR
A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.
☆136Updated 8 months ago
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆311Updated last year
snowflakedb / ArcticTraining
ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)
☆231Updated last week
RulinShao / retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆216Updated this week
huggingface / kernels
Load compute kernels from the Hub
☆304Updated last week
princeton-nlp / HELMET
The HELMET Benchmark
☆178Updated 2 months ago
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆198Updated 5 months ago
foundation-model-stack / fms-fsdp
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash…
☆270Updated 3 months ago
QwenLM / ParScale
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
☆448Updated 5 months ago
HazyResearch / lolcats
Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
☆248Updated 8 months ago
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆254Updated this week