thepowerfuldeez / OLMoLinks

My fork os allen AI's OLMo for educational purposes.

☆30

Alternatives and similar repositories for OLMo

Users that are interested in OLMo are comparing it to the libraries listed below

Sorting:

samchaineau / llm_slerp_generation
Repo hosting codes and materials related to speeding LLMs' inference using token merging.
☆36Updated last year
wuhy68 / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆144Updated 9 months ago
bespokelabsai / verifiers
Verifiers for LLM Reinforcement Learning
☆61Updated 2 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆77Updated last year
RobertCsordas / moe_attention
Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"
☆98Updated 8 months ago
SeunghyunSEO / optimized_hf_llama_class_for_training
☆47Updated 10 months ago
arcee-ai / DAM
☆51Updated 7 months ago
jeffreysijuntan / lloco
The official repo for "LLoCo: Learning Long Contexts Offline"
☆117Updated last year
IST-DASLab / QuEST
Work in progress.
☆69Updated 3 weeks ago
HanGuo97 / lq-lora
☆126Updated last year
wdlctc / mini-s
☆51Updated 7 months ago
Zyphra / Zyda_processing
☆35Updated last year
NathanGodey / qfilters
Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)
☆33Updated 3 months ago
astramind-ai / Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
☆163Updated last year
sanyalsunny111 / LLM-Inheritune
This is the official repository for Inheritune.
☆111Updated 4 months ago
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆55Updated this week
RobertCsordas / moeut
☆79Updated 10 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 2 months ago
IST-DASLab / RoSA
Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)
☆42Updated last year
SalesforceAIResearch / GemFilter
☆80Updated 5 months ago
PiotrNawrot / sparse-frontier
The evaluation framework for training-free sparse attention in LLMs
☆79Updated last week
Tomorrowdawn / top_nsigma
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆26Updated 4 months ago
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆92Updated 7 months ago
nanowell / Q-Sparse-LLM
My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated
☆32Updated 10 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆102Updated 2 months ago
siyan-zhao / prepacking
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …
☆59Updated 8 months ago
FasterDecoding / BitDelta
☆198Updated 6 months ago
Pleias / Various-Finetuning
Set of scripts to finetune LLMs
☆37Updated last year
vicksEmmanuel / latent-gemma
☆26Updated 5 months ago
SalesforceAIResearch / LaTRO
☆115Updated 4 months ago