chu-tianxiang / exl2-for-all
EXL2 quantization generalized to other models.
☆10Updated last year
Alternatives and similar repositories for exl2-for-all:
Users that are interested in exl2-for-all are comparing it to the libraries listed below
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated 11 months ago
- ☆27Updated last year
- Loader extension for tabbyAPI in SillyTavern☆25Updated 8 months ago
- Train Llama Loras Easily☆31Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated 11 months ago
- My personal fork of koboldcpp where I hack in experimental samplers.☆44Updated 10 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆33Updated last year
- Interact with a AI Game-engine that keep building its rules and world as you play, adapted to your gameplay.☆42Updated 9 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 4 months ago
- ☆9Updated 10 months ago
- Easily view and modify JSON datasets for large language models☆72Updated last month
- Text WebUI extension to add clever Notebooks to Chat mode☆139Updated last year
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models☆36Updated last year
- 8-bit CUDA functions for PyTorch☆25Updated last year
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- This project is established for real-time training of the RWKV model.☆49Updated 10 months ago
- A pipeline parallel training script for LLMs.☆136Updated last week
- ☆53Updated 10 months ago
- Efficient 3bit/4bit quantization of LLaMA models☆19Updated last year
- LLM inference in C/C++☆20Updated 2 weeks ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆29Updated this week
- entropix style sampling + GUI☆25Updated 5 months ago
- Traing PRO extension for oobabooga WebUI - recent dev version☆48Updated 2 months ago
- Make abliterated models with transformers, easy and fast☆64Updated 2 weeks ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 7 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 9 months ago
- Genertaes control vectors for use with llama.cpp in GGUF format.☆19Updated 2 weeks ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆32Updated 8 months ago