ariG23498 / quantized-diffusion-inferenceLinks
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆38Updated last year
Alternatives and similar repositories for quantized-diffusion-inference
Users that are interested in quantized-diffusion-inference are comparing it to the libraries listed below
Sorting:
- Notebooks for fine tuning pali gemma☆117Updated 9 months ago
- Fine tune Gemma 3 on an object detection task☆97Updated 6 months ago
- ☆46Updated 8 months ago
- Notebooks to demonstrate TimmWrapper☆16Updated last year
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated last year
- Collection of autoregressive model implementation☆85Updated 3 weeks ago
- ☆34Updated 7 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated last year
- ☆115Updated 5 months ago
- Train LLM on Hugging Face infra☆67Updated 2 months ago
- ☆125Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Updated 6 months ago
- ☆46Updated 10 months ago
- LoRA and DoRA from Scratch Implementations☆215Updated last year
- ☆30Updated 9 months ago
- MatFormer repo☆70Updated last year
- Set of scripts to finetune LLMs☆38Updated last year
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.☆84Updated 2 months ago
- Hands-on hub to learn techniques to optimize and serve AI models to production the most optimal way.☆14Updated 5 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆98Updated last year
- Quantization of LLMs and benchmarking.☆10Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- Building LLMs from scratch following the book from S. Raschka☆32Updated 10 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- ☆14Updated 7 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- Google TPU optimizations for transformers models☆134Updated 2 weeks ago
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆46Updated last week
- ☆59Updated last year