ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆36Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for quantized-diffusion-inference
- Collection of autoregressive model implementation☆67Updated this week
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆18Updated 3 months ago
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- ☆115Updated 3 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- ☆45Updated 2 months ago
- Quantization of LLMs and benchmarking.☆10Updated 7 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆30Updated last month
- ☆40Updated 2 weeks ago
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆29Updated 2 years ago
- ☆40Updated 6 months ago
- Contains materials for my talk "You don't know TensorFlow".☆9Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 weeks ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆50Updated 7 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 7 months ago
- Notebooks for fine tuning pali gemma☆41Updated 3 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- Efficient CUDA kernels for training convolutional neural networks with PyTorch.☆35Updated 3 weeks ago
- (WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, B…☆81Updated 2 months ago
- DPO, but faster 🚀☆23Updated 3 weeks ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆33Updated 5 months ago
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆30Updated last year
- ☆58Updated 8 months ago
- End-to-End LLM Guide☆97Updated 4 months ago
- Prune transformer layers☆64Updated 5 months ago
- ☆24Updated last year
- I learn about and explain quantization☆25Updated 7 months ago
- JAX Implementation of Black Forest Labs' Flux.1 family of models☆14Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago