thepowerfuldeez / OLMoLinks
My fork os allen AI's OLMo for educational purposes.
☆30Updated 6 months ago
Alternatives and similar repositories for OLMo
Users that are interested in OLMo are comparing it to the libraries listed below
Sorting:
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆47Updated last month
- A repository for research on medium sized language models.☆76Updated last year
- ☆47Updated 9 months ago
- ☆79Updated 9 months ago
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆97Updated 8 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆143Updated 8 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 8 months ago
- Collection of autoregressive model implementation☆85Updated last month
- ☆51Updated 7 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆85Updated last year
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆32Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆56Updated last month
- The official repo for "LLoCo: Learning Long Contexts Offline"☆116Updated 11 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- ☆125Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆199Updated 10 months ago
- ☆49Updated 7 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆72Updated 2 weeks ago
- ☆25Updated 4 months ago
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆42Updated last year
- QuIP quantization☆52Updated last year
- ☆79Updated 4 months ago
- Work in progress.☆68Updated last week
- ☆45Updated 3 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- PB-LLM: Partially Binarized Large Language Models☆152Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆124Updated last year
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆62Updated 4 months ago