zipnn / zipnn
A Lossless Compression Library for AI pipelines
β245Updated last week
Alternatives and similar repositories for zipnn:
Users that are interested in zipnn are comparing it to the libraries listed below
- π¦ Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data β¦β191Updated last week
- β207Updated this week
- Top papers related to LLM-based agent evaluationβ52Updated this week
- TokenSHAP: Explain individual token importance in large language model prompts with SHAP values. Gain insights, debug models, detect biasβ¦β42Updated last month
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on Onβ¦β205Updated last month
- PyTorch per step fault tolerance (actively under development)β291Updated this week
- Module, Model, and Tensor Serialization/Deserializationβ225Updated 2 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clustersβ126Updated 5 months ago
- Manage ML configuration with pydanticβ16Updated 5 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β131Updated 4 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.β199Updated 9 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"β154Updated 6 months ago
- Google TPU optimizations for transformers modelsβ109Updated 3 months ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"β140Updated last year
- Inference server benchmarking toolβ56Updated last week
- Accelerating your LLM training to full speed! Made with β€οΈ by ServiceNow Researchβ188Updated this week
- DeMo: Decoupled Momentum Optimizationβ185Updated 5 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024β292Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needsβ284Updated this week
- β37Updated 3 months ago
- TensorHue is a Python library that allows you to visualize tensors right in your console, making understanding and debugging tensor conteβ¦β115Updated 2 months ago
- Scalable and Performant Data Loadingβ252Updated this week
- Tokun to can tokensβ17Updated this week
- Example ML projects that use the Determined library.β32Updated 7 months ago
- β89Updated last week
- This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Reportβ30Updated 2 months ago
- β103Updated 11 months ago
- β117Updated 8 months ago
- This repository contains the experimental PyTorch native float8 training UXβ224Updated 9 months ago
- A collection of all available inference solutions for the LLMsβ87Updated 2 months ago