zipnn / zipnn
A Lossless Compression Library for AI pipelines
β226Updated last week
Alternatives and similar repositories for zipnn:
Users that are interested in zipnn are comparing it to the libraries listed below
- π¦ Unitxt: a python library for getting data fired up and set for training and evaluationβ177Updated this week
- An open source interactive spectrogram audio player, primarily based on bokeh and the holoviz stack (wav+holoviz=waloviz)β66Updated 7 months ago
- β167Updated this week
- β22Updated this week
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.β122Updated last week
- PyTorch per step fault tolerance (actively under development)β260Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β123Updated 2 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"β154Updated 4 months ago
- π Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.β36Updated this week
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clustersβ123Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ260Updated 4 months ago
- TokenSHAP: Explain individual token importance in large language model prompts with SHAP values. Gain insights, debug models, detect biasβ¦β38Updated last week
- Google TPU optimizations for transformers modelsβ102Updated last month
- Scalable and Performant Data Loadingβ222Updated this week
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)β45Updated this week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokensβ133Updated 2 weeks ago
- Code for studying the super weight in LLMβ90Updated 3 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needsβ208Updated this week
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".β191Updated this week
- A safetensors extension to efficiently store sparse quantized tensors on diskβ80Updated this week
- Aana SDK is a powerful framework for building AI enabled multimodal applications.β42Updated last week
- A pytest plugin for running and analyzing LLM evaluation tests.β104Updated last month
- Self-hosted LLM chatbot arena, with yourself as the only judgeβ37Updated last year
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)β13Updated 6 months ago
- β33Updated last year
- DeMo: Decoupled Momentum Optimizationβ181Updated 3 months ago
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"β221Updated last month
- β110Updated 2 months ago