A Lossless Compression Library for AI pipelines
☆316Apr 11, 2026Updated 2 months ago
Alternatives and similar repositories for zipnn
Users that are interested in zipnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of "Dataset Size Recovery from LoRA Weights" paper.☆34Jun 30, 2024Updated last year
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024☆30Dec 19, 2024Updated last year
- An official implementation of ProbeGen☆13Oct 20, 2024Updated last year
- ☆11Aug 25, 2024Updated last year
- Annotatability, a method to identify meaningful patterns in single-cell genomics data through annotation-trainability analysis, which est…☆19Jun 23, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Jul 13, 2025Updated 11 months ago
- Top papers related to LLM-based agent evaluation☆91Oct 21, 2025Updated 7 months ago
- Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).☆86Apr 15, 2025Updated last year
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆214May 27, 2026Updated 2 weeks ago
- Code release for "Time Series Anomaly Detection by Cumulative Radon Features"☆12Feb 8, 2022Updated 4 years ago
- [AAAI 2025] Official Implementation for "Click2Mask: Local Editing with Dynamic Mask Generation" Paper.☆21Jan 22, 2026Updated 4 months ago
- Comprehensive LLM Error Analysis and Reporting☆48Updated this week
- [ACL 2026 Oral] Official implementation of LaMI: Augmenting Large Language Models via Late Multi-Image Fusion☆19May 18, 2026Updated 3 weeks ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆229Mar 14, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Zebin Ren and Animesh Trivedi. 2023. Performance Characterization of Modern Storage Stacks: POSIX I/O, libaio, SPDK, and io_uring. In Pro…☆13Mar 30, 2023Updated 3 years ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆96Sep 4, 2024Updated last year
- Pytorch implementation of "Oscillation-Reduced MXFP4 Training for Vision Transformers" on DeiT Model Pre-training☆40May 4, 2026Updated last month
- Achieve state of the art inference performance with modern accelerators on Kubernetes☆3,312Jun 6, 2026Updated last week
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated last year
- ☆16May 14, 2025Updated last year
- some mixture of experts architecture implementations☆27Mar 22, 2024Updated 2 years ago
- Official PyTorch implementation for ״ lassification-Regression for Chart Comprehension״☆26Feb 5, 2025Updated last year
- ☆50Jan 18, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated last year
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- ☆53Oct 29, 2024Updated last year
- ☆27Apr 23, 2026Updated last month
- The official repo of the paper "StressTest: Can YOUR Speech LM Handle the Stress?"☆20Jul 9, 2025Updated 11 months ago
- The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)☆50Aug 15, 2025Updated 9 months ago
- Estimate resources needed to train LLMs☆14Feb 10, 2026Updated 4 months ago
- ☆91Oct 17, 2025Updated 7 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆63May 16, 2025Updated last year
- FZ-GPU: A Fast and High-Ratio Lossy Compressor for Scientific Data on GPUs☆14Sep 26, 2023Updated 2 years ago
- A tiny FP8 multiplication unit written in Verilog. TinyTapeout 2 submission.☆14Nov 23, 2022Updated 3 years ago
- [MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving☆341Jul 2, 2024Updated last year
- ☆47Feb 26, 2026Updated 3 months ago
- The driver for LMCache core to run in vLLM☆67Feb 4, 2025Updated last year
- ☆14Dec 1, 2025Updated 6 months ago