zipnn / zipnnLinks
A Lossless Compression Library for AI pipelines
☆268Updated 2 weeks ago
Alternatives and similar repositories for zipnn
Users that are interested in zipnn are comparing it to the libraries listed below
Sorting:
- ☆230Updated this week
- Simple high-throughput inference library☆120Updated 2 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆198Updated last year
- ☆134Updated 11 months ago
- Google TPU optimizations for transformers models☆116Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆265Updated 9 months ago
- Inference server benchmarking tool☆83Updated 2 months ago
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.☆139Updated this week
- Scalable and Performant Data Loading☆288Updated this week
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆153Updated 9 months ago
- DeMo: Decoupled Momentum Optimization☆189Updated 7 months ago
- ☆369Updated this week
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆244Updated 5 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆128Updated 7 months ago
- ☆74Updated 3 weeks ago
- ☆96Updated last month
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆214Updated this week
- Load compute kernels from the Hub☆207Updated this week
- A safetensors extension to efficiently store sparse quantized tensors on disk☆137Updated this week
- ☆188Updated 3 weeks ago
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆206Updated this week
- A tool to configure, launch and manage your machine learning experiments.☆172Updated this week
- Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)☆361Updated last week
- This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Report☆30Updated 4 months ago
- PyTorch implementation of models from the Zamba2 series.☆184Updated 5 months ago
- A collection of all available inference solutions for the LLMs☆91Updated 4 months ago
- ☆128Updated 3 months ago
- ☆45Updated last year
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆405Updated this week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆80Updated 2 months ago