ariG23498 / quantized-diffusion-inference
Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs
☆33Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for quantized-diffusion-inference
- Recaption large (Web)Datasets with vllm and save the artifacts.☆30Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Collection of autoregressive model implementation☆66Updated last week
- A list of language models with permissive licenses such as MIT or Apache 2.0☆22Updated last week
- Contains materials for my talk "You don't know TensorFlow".☆9Updated last year
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- ☆57Updated 7 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- ☆21Updated last week
- Implementation of BitNet-1.58 instruct tuning☆18Updated 6 months ago
- ☆115Updated 2 weeks ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆52Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆26Updated 5 months ago
- ☆62Updated last month
- ☆40Updated this week
- This is the official repository of ISMIR 2024 paper "Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional R…☆40Updated last month
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆49Updated 7 months ago
- Notebooks for fine tuning pali gemma☆41Updated 3 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 9 months ago
- (WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, B…☆81Updated 2 months ago
- Quantization of LLMs and benchmarking.☆10Updated 7 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆49Updated last week
- Code for NeurIPS LLM Efficiency Challenge☆53Updated 7 months ago
- MEXMA: Token-level objectives improve sentence representations☆32Updated this week
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆30Updated last year
- ☆44Updated 2 months ago
- Tools for merging pretrained large language models.☆19Updated 5 months ago
- 🍳 AyaMCooking is a Voice-to-Voice Mutli-lingual RAG Agent that makes a perfect sous chef for your kitchen, in upto 10 Languages 🤌🧑🍳☆17Updated 2 weeks ago
- ☆24Updated last year