xhedit / quantkit
cli tool to quantize gguf, gptq, awq, hqq and exl2 models
☆68Updated last month
Alternatives and similar repositories for quantkit:
Users that are interested in quantkit are comparing it to the libraries listed below
- An unsupervised model merging algorithm for Transformers-based language models.☆104Updated 9 months ago
- A pipeline parallel training script for LLMs.☆122Updated 2 weeks ago
- Easily view and modify JSON datasets for large language models☆70Updated this week
- ☆111Updated last month
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆96Updated this week
- A python package for developing AI applications with local LLMs.☆145Updated last month
- entropix style sampling + GUI☆25Updated 3 months ago
- Scripts to create your own moe models using mlx☆86Updated 11 months ago
- ☆65Updated 8 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆67Updated 4 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆232Updated 8 months ago
- All the world is a play, we are but actors in it.☆47Updated this week
- idea: https://github.com/nyxkrage/ebook-groupchat/☆85Updated 5 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆169Updated 9 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 2 months ago
- ☆52Updated 8 months ago
- ☆152Updated 7 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆90Updated 7 months ago
- Simple examples using Argilla tools to build AI☆53Updated 2 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆45Updated last month
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 5 months ago
- Let's create synthetic textbooks together :)☆73Updated last year
- automatically quant GGUF models☆154Updated this week
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆217Updated 9 months ago
- run ollama & gguf easily with a single command☆49Updated 9 months ago
- Distributed Inference for mlx LLm☆82Updated 6 months ago
- ☆23Updated 2 months ago
- Very basic framework for parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT. Architecture …☆37Updated 2 weeks ago