ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆38Updated 5 months ago
Alternatives and similar repositories for quantized-diffusion-inference:
Users that are interested in quantized-diffusion-inference are comparing it to the libraries listed below
- Recaption large (Web)Datasets with vllm and save the artifacts.☆50Updated 4 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 3 months ago
- Collection of autoregressive model implementation☆85Updated last month
- Quantization of LLMs and benchmarking.☆10Updated last year
- Notebooks for fine tuning pali gemma☆98Updated 3 months ago
- Train, tune, and infer Bamba model☆88Updated 2 months ago
- Notebooks to demonstrate TimmWrapper☆16Updated 2 months ago
- Contains materials for my talk "You don't know TensorFlow".☆9Updated 2 years ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆46Updated last month
- ☆45Updated last week
- ☆12Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".☆18Updated this week
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆54Updated last year
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated 2 weeks ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 11 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆107Updated last month
- Cray-LM unified training and inference stack.☆22Updated 2 months ago
- Arxflix turns your boring Arxiv research paper into a captivating video.☆46Updated 4 months ago
- DPO, but faster 🚀☆40Updated 4 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- Implementation of a Light Recurrent Unit in Pytorch☆47Updated 6 months ago
- ☆122Updated 5 months ago
- ☆47Updated 7 months ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Updated 2 years ago
- ☆58Updated last year
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated last week
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆27Updated last month