zipnn / zipnn
A Lossless Compression Library for AI pipelines
☆240Updated this week
Alternatives and similar repositories for zipnn:
Users that are interested in zipnn are comparing it to the libraries listed below
- ☆190Updated last week
- An open source interactive spectrogram audio player, primarily based on bokeh and the holoviz stack (wav+holoviz=waloviz)☆66Updated 8 months ago
- PyTorch per step fault tolerance (actively under development)☆273Updated this week
- Module, Model, and Tensor Serialization/Deserialization☆221Updated last month
- 🦄 Unitxt: a python library for getting data fired up and set for training and evaluation☆183Updated this week
- Scalable and Performant Data Loading☆234Updated this week
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 5 months ago
- Pragmatic approach to parsing import profiles for CI's☆11Updated 9 months ago
- DeMo: Decoupled Momentum Optimization☆185Updated 4 months ago
- Google TPU optimizations for transformers models☆107Updated 2 months ago
- NLP with Rust for Python 🦀🐍☆61Updated 10 months ago
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆39Updated this week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆135Updated last month
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆128Updated 3 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆248Updated this week
- Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead…☆629Updated this week
- ☆21Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆262Updated 6 months ago
- Inference server benchmarking tool☆48Updated last week
- Focused on fast experimentation and simplicity☆71Updated 3 months ago
- An introduction to LLM Sampling☆77Updated 3 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆32Updated last month
- Load compute kernels from the Hub☆113Updated this week
- An implementation of PSGD Kron second-order optimizer for PyTorch☆86Updated last week
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conte…☆114Updated last month
- ☆82Updated this week
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆156Updated this week
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆125Updated 4 months ago
- Efficient optimizers☆186Updated this week
- ☆76Updated 9 months ago