BorealisAI / neuzip
Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This repository contains the code for the experiments in the paper.
☆27Updated last week
Related projects ⓘ
Alternatives and complementary repositories for neuzip
- Official implementation of ECCV24 paper: POA☆24Updated 3 months ago
- Lottery Ticket Adaptation☆36Updated last month
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆45Updated 4 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆22Updated last week
- QuIP quantization☆46Updated 7 months ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆15Updated this week
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆91Updated last month
- ☆43Updated 3 months ago
- ☆49Updated 3 weeks ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆37Updated 6 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- ☆62Updated last month
- A repository for research on medium sized language models.☆74Updated 5 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆30Updated 2 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆30Updated last month
- Implementation of the Mamba SSM with hf_integration.☆55Updated 2 months ago
- ☆18Updated last month
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆21Updated 5 months ago
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆34Updated 3 weeks ago
- DPO, but faster 🚀☆21Updated 2 weeks ago
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆14Updated 3 months ago
- From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging☆52Updated last month
- ☆57Updated last month
- This repository contains code for the MicroAdam paper.☆12Updated 4 months ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆18Updated last month
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆27Updated 4 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆37Updated 2 months ago
- 32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.☆42Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆37Updated 3 weeks ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago