ariG23498 / quantized-diffusion-inferenceLinks

Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs

☆38

Alternatives and similar repositories for quantized-diffusion-inference

Users that are interested in quantized-diffusion-inference are comparing it to the libraries listed below

Sorting:

ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆74Updated 3 weeks ago
ariG23498 / timm-wrapper-examples
Notebooks to demonstrate TimmWrapper
☆16Updated 6 months ago
ariG23498 / fine-tune-paligemma
Notebooks for fine tuning pali gemma
☆112Updated 3 months ago
hkproj / multi-latent-attention
☆43Updated 2 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 3 months ago
kmohan321 / Research_Papers
☆46Updated 4 months ago
huggingface / competitions
☆124Updated 9 months ago
sayakpaul / simple-image-recaptioning
Recaption large (Web)Datasets with vllm and save the artifacts.
☆52Updated 8 months ago
ashishpatel26 / ai-tutor-rag-system
This is a repository for the course "From Beginner to LLM Developer" by Towards AI.
☆11Updated 7 months ago
adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…
☆82Updated last year
ariG23498 / mmdp
☆27Updated 3 weeks ago
ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference 🚀
☆48Updated last year
cornstarch-org / Cornstarch
☆99Updated 2 months ago
joey00072 / Multi-Head-Latent-Attention-MLA-
working implimention of deepseek MLA
☆42Updated 6 months ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆117Updated 2 weeks ago
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆117Updated 6 months ago
fangyuan-ksgk / Mini-LLaVA
A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.
☆94Updated 7 months ago
alexander-moore / vlm
Composition of Multimodal Language Models From Scratch
☆15Updated 11 months ago
sayakpaul / you-dont-know-tensorflow
Contains materials for my talk "You don't know TensorFlow".
☆9Updated 2 years ago
rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆207Updated last year
sayakpaul / nanoDiT
Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.
☆120Updated 2 months ago
thomwolf / sesame-explorations
☆29Updated 3 months ago
joaopauloschuler / less-parameters-llm
This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Report
☆30Updated 5 months ago
devvrit / matformer
MatFormer repo
☆59Updated 7 months ago
huggingface / pixparse
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆21Updated last year
cityzen95 / LLM_from_scratch
Building LLMs from scratch following the book from S. Raschka
☆31Updated 4 months ago
julien-blanchon / arxflix
Arxflix turns your boring Arxiv research paper into a captivating video.
☆52Updated 2 months ago
facebookresearch / Mixture-of-Transformers
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.
☆88Updated 2 months ago
cosmo3769 / Quantized-LLMs
Quantization of LLMs and benchmarking.
☆10Updated last year
apple / ml-mofi
☆59Updated last year