Faster Pytorch bitsandbytes 4bit fp4 nn.Linear ops
☆30Mar 16, 2024Updated 2 years ago
Alternatives and similar repositories for torch-bnb-fp4
Users that are interested in torch-bnb-fp4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- Waifu2x keras implementation https://github.com/nagadomi/waifu2x☆10Jun 19, 2016Updated 9 years ago
- ☆17Dec 12, 2021Updated 4 years ago
- Tensorflow 2.x implementation of Gradient Origin Networks☆12Jul 13, 2020Updated 5 years ago
- Unofficial Scalable-Softmax Is Superior for Attention☆20May 30, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)☆45Aug 19, 2021Updated 4 years ago
- High Performance Int8 GEMM Kernels for SM80 and later GPUs.☆23Mar 11, 2025Updated last year
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆78Dec 3, 2024Updated last year
- ☆12Sep 1, 2023Updated 2 years ago
- PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)☆18Jun 22, 2022Updated 3 years ago
- ACL 2023☆39Jun 6, 2023Updated 3 years ago
- ☆12Jan 4, 2024Updated 2 years ago
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Knowledge Graph Generator app☆35Apr 18, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- ☆19Nov 6, 2023Updated 2 years ago
- ☆13Jun 3, 2024Updated 2 years ago
- Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Netw…☆18Feb 27, 2020Updated 6 years ago
- Build modern UIs in Jupyter with Python☆12Dec 28, 2022Updated 3 years ago
- ☆26Updated this week
- An OpenAI API compatible images server to generate or manipulate images.☆18Feb 2, 2025Updated last year
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ICLR 2021☆48Mar 18, 2021Updated 5 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- ☆235Jun 11, 2024Updated 2 years ago
- This repository contains the experimental PyTorch native float8 training UX☆226Aug 1, 2024Updated last year
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆25Mar 29, 2024Updated 2 years ago
- [ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation☆161Mar 21, 2025Updated last year
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆55Jan 28, 2024Updated 2 years ago
- a simple variational auto encoder with some exploration☆12Nov 22, 2024Updated last year
- ☆22Feb 11, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Utility programs to pipe data across a RDMA-capable network☆19Mar 14, 2026Updated 3 months ago
- Android demo for dabnn☆20Oct 18, 2019Updated 6 years ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆78May 30, 2026Updated 2 weeks ago