GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compression of numerical and other data types in HPC/ML applications.
☆391Mar 18, 2026Updated 2 months ago
Alternatives and similar repositories for dietgpu
Users that are interested in dietgpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for nvCOMP docs and examples. nvCOMP is a library for fast lossless compression/decompression on the GPU that can be downloade…☆623Sep 11, 2024Updated last year
- A collection of tools for neural compression enthusiasts.☆599Sep 20, 2024Updated last year
- ☆15Aug 3, 2021Updated 4 years ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- A library for distributed ML training with PyTorch☆366Dec 12, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Some of the fastest decoding range-based Asymetric Numeral Systems (rANS) codecs for x64☆20Sep 3, 2024Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- rANS coder (derived from https://github.com/rygorous/ryg_rans)☆87Mar 15, 2022Updated 4 years ago
- ☆15Jun 10, 2022Updated 3 years ago
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,077Apr 17, 2024Updated 2 years ago
- Massively Parallel ANS Decoding on GPUs☆30Jul 26, 2019Updated 6 years ago
- A library for unit scaling in PyTorch☆133Jul 11, 2025Updated 10 months ago
- Recoil: Parallel rANS Decoding with Decoder-Adaptive Scalability☆18Jun 26, 2023Updated 2 years ago
- Generic image compressor for machine learning. Pytorch code for our paper "Lossy compression for lossless prediction".☆120Aug 19, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simple repository contribution statistics☆15May 14, 2026Updated last week
- New generation entropy codecs : Finite State Entropy and Huff0☆1,483Mar 21, 2024Updated 2 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆40Aug 28, 2021Updated 4 years ago
- ☆21Aug 18, 2022Updated 3 years ago
- ☆22Aug 31, 2021Updated 4 years ago
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- Custom compression for CRAM and others.☆38May 7, 2026Updated 2 weeks ago
- Python Research Framework☆107Nov 3, 2022Updated 3 years ago
- ☆47Nov 18, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A case study of efficient training of large language models using commodity hardware.☆67Aug 4, 2022Updated 3 years ago
- PyTorch extensions for high performance and large scale training.☆3,406Apr 26, 2025Updated last year
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Jul 26, 2022Updated 3 years ago
- ☆13Mar 25, 2024Updated 2 years ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆81Mar 17, 2022Updated 4 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Nov 7, 2017Updated 8 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- TurboRC - Fastest Range Coder + Arithmetic Coding / Fastest Asymmetric Numeral Systems☆92Apr 11, 2026Updated last month
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆390May 6, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official repo for the ICLR22 paper "Towards Empirical Sandwich Bounds on the Rate-Distortion Function"☆13Feb 1, 2023Updated 3 years ago
- Public repository for managing Grid Platform documentation synced with gitbook on docs.grid.ai☆19Mar 27, 2026Updated last month
- Entropy coding / arithmetic coding for PyTorch☆267Jul 8, 2022Updated 3 years ago
- Directed masked autoencoders☆15Mar 25, 2026Updated last month
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago
- Fairring (FAIR + Herring) is a plug-in for PyTorch that provides a process group for distributed training that outperforms NCCL at large …☆66Mar 21, 2022Updated 4 years ago
- GPU-Accelerated Lossless Data Compressors Survey☆123Sep 10, 2020Updated 5 years ago