vcskaushik / LLMzipLinks
☆50Updated 4 months ago
Alternatives and similar repositories for LLMzip
Users that are interested in LLMzip are comparing it to the libraries listed below
Sorting:
- ☆137Updated 9 months ago
- ☆48Updated 10 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆59Updated 7 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- QuIP quantization☆52Updated last year
- An implementation of LLMzip using GPT-2☆12Updated last year
- ☆79Updated 9 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆28Updated 8 months ago
- ☆50Updated 7 months ago
- ☆93Updated 8 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆47Updated last month
- PB-LLM: Partially Binarized Large Language Models☆152Updated last year
- ☆44Updated last year
- This is the code that went into our practical dive using mamba as information extraction☆53Updated last year
- ☆68Updated 10 months ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆37Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆136Updated 8 months ago
- Here we will test various linear attention designs.☆58Updated last year
- This repository contains code for the MicroAdam paper.☆19Updated 5 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind☆126Updated 9 months ago
- ☆61Updated last year
- Work in progress.☆67Updated last week
- Simple repository for training small reasoning models☆31Updated 4 months ago
- ☆197Updated 6 months ago
- ☆125Updated last year
- RWKV-7: Surpassing GPT☆88Updated 6 months ago
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆23Updated 6 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆66Updated 5 months ago