8-bit CUDA functions for PyTorch
☆72Sep 24, 2025Updated 7 months ago
Alternatives and similar repositories for bitsandbytes
Users that are interested in bitsandbytes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast and memory-efficient exact attention☆230Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆32Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- Ahead of Time (AOT) Triton Math Library☆97Apr 17, 2026Updated 2 weeks ago
- Development repository for the Triton language and compiler☆144Apr 25, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Everything you need to setup on your AMD system for Machine Learning Stuff☆19Jul 31, 2025Updated 9 months ago
- ☆24Jul 16, 2025Updated 9 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆152Apr 24, 2026Updated last week
- LLM as World Models using Bayesian inference☆17May 27, 2025Updated 11 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆140Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆26Updated this week
- Functional Regularisation for Continual Learning with Gaussian Processes☆15Oct 24, 2020Updated 5 years ago
- AMD's graph optimization engine.☆295Updated this week
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆42Apr 4, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Nov 30, 2018Updated 7 years ago
- ☆14Sep 24, 2018Updated 7 years ago
- Fast and memory-efficient exact attention ported to rocm☆13Dec 1, 2023Updated 2 years ago
- A low-cost, high-performance deep learning training framework that enables efficient 100B-scale model fine-tuning on a commodity server w…☆23Mar 21, 2025Updated last year
- ☆11Jun 29, 2021Updated 4 years ago
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆24Apr 20, 2026Updated last week
- The AMD rocAL is designed to efficiently decode and process images and videos from a variety of storage formats and modify them through a…☆23Apr 23, 2026Updated last week
- Adaptive Convolutions with Per-pixel Dynamic Filter Atom☆27Sep 3, 2021Updated 4 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26Apr 24, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A framework for creating message-driven training systems with PyTorch☆21Oct 7, 2025Updated 6 months ago
- Benchmarking tool for vLLM inference performance with GPU monitoring☆49Apr 16, 2026Updated 2 weeks ago
- 지하철도 구구구☆10Sep 12, 2020Updated 5 years ago
- Bytebeat player with a collection of many formulas from around the internet.☆18Dec 2, 2025Updated 5 months ago
- Mirror only see https://gitlab.rtems.org/rtems/docs/rtems-docs/☆11Apr 23, 2026Updated last week
- Pytorch implementation for Decomposed Convolutional Filters Network☆23Feb 19, 2020Updated 6 years ago
- ☆27Nov 13, 2025Updated 5 months ago
- ☆21Oct 30, 2024Updated last year
- Place & Router for Minetest☆18Nov 5, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific…☆214Updated this week
- GoldFinch and other hybrid transformer components☆13Dec 9, 2025Updated 4 months ago
- Scripts to recover (accidentally) deleted files from ext3 partitions☆14Aug 16, 2017Updated 8 years ago
- This project is a implementation in PyTorch for ZO-AdaMU optimization: Adapting Perturbation with the Momentum and Uncertainty in Zeroth-…☆14Dec 12, 2023Updated 2 years ago
- ☆142Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆257Apr 21, 2026Updated last week
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆14Jan 8, 2026Updated 3 months ago