xhedit / quantkit
cli tool to quantize gguf, gptq, awq, hqq and exl2 models
☆59Updated this week
Related projects: ⓘ
- ☆101Updated 6 months ago
- Low-Rank adapter extraction for fine-tuned transformers model☆154Updated 4 months ago
- Easily view and modify JSON datasets for large language models☆55Updated this week
- ☆144Updated 2 months ago
- A python package for developing AI applications with local LLMs.☆137Updated 2 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆84Updated 2 months ago
- All the world is a play, we are but actors in it.☆46Updated 2 months ago
- ☆50Updated 3 months ago
- A pipeline parallel training script for LLMs.☆79Updated last month
- Scripts to create your own moe models using mlx☆86Updated 6 months ago
- ☆64Updated 3 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆96Updated 4 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- ☆28Updated this week
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated last month
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆44Updated last month
- idea: https://github.com/nyxkrage/ebook-groupchat/☆77Updated last month
- Something similar to Apple Intelligence?☆54Updated 2 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3☆33Updated last week
- Let's create synthetic textbooks together :)☆70Updated 7 months ago
- automatically quant GGUF models☆119Updated this week
- Experimental LLM Inference UX to aid in creative writing☆81Updated 2 months ago
- For inferring and serving local LLMs using the MLX framework☆77Updated 5 months ago
- 5X faster 60% less memory QLoRA finetuning☆21Updated 3 months ago
- ☆26Updated last year
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆116Updated last week
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆85Updated this week
- ☆71Updated last year
- ☆23Updated last month
- Very basic framework for parameterized large language model (Q)LoRa fine-tuning using mlx, mlx_lm, and OgbujiPT. Architecture for system…☆32Updated last month