ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆38Updated 6 months ago
Alternatives and similar repositories for quantized-diffusion-inference:
Users that are interested in quantized-diffusion-inference are comparing it to the libraries listed below
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 9 months ago
- Notebooks for fine tuning pali gemma☆100Updated 3 weeks ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 5 months ago
- Collection of autoregressive model implementation☆85Updated last week
- Notebooks to demonstrate TimmWrapper☆16Updated 3 months ago
- ☆123Updated 6 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆54Updated last year
- ☆45Updated last month
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- minimal GRPO implementation from scratch☆87Updated last month
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 4 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated 11 months ago
- ☆24Updated last week
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆49Updated last month
- working implimention of deepseek MLA☆40Updated 3 months ago
- Focused on fast experimentation and simplicity☆71Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- ☆16Updated last week
- Smart commit messages☆18Updated 6 months ago
- This repository contains the source code for the Saving 77% of the Parameters in Large Language Models Technical Report☆30Updated 2 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆108Updated 2 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆46Updated 6 months ago
- Quantization of LLMs and benchmarking.☆10Updated last year
- ☆58Updated last year
- ☆12Updated 2 weeks ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 10 months ago
- Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.☆56Updated last month
- https://x.com/BlinkDL_AI/status/1884768989743882276☆27Updated this week
- Contains materials for my talk "You don't know TensorFlow".☆9Updated 2 years ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆58Updated 6 months ago