jukofyork / control-vectors
Genertaes control vectors for use with llama.cpp in GGUF format.
☆16Updated 4 months ago
Alternatives and similar repositories for control-vectors:
Users that are interested in control-vectors are comparing it to the libraries listed below
- ☆27Updated last year
- entropix style sampling + GUI☆25Updated 2 months ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆35Updated this week
- An unsupervised model merging algorithm for Transformers-based language models.☆101Updated 8 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆42Updated 9 months ago
- Train Llama Loras Easily☆30Updated last year
- Based on kylemcdonald/i2i-realtime. The warping server for GenDJ real time webcam AI warping☆26Updated 5 months ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- Interact with a AI Game-engine that keep building its rules and world as you play, adapted to your gameplay.☆43Updated 6 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆29Updated 5 months ago
- All the world is a play, we are but actors in it.☆47Updated this week
- Docker image for the Text Generation Web UI: A Gradio web UI for Large Language Models. Supports Transformers, AWQ, GPTQ, llama.cpp (GGUF…☆1Updated 5 months ago
- A simple framework for using a local Koboldcpp LLM to help with story-writing☆19Updated last year
- ☆22Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆23Updated 11 months ago
- ☆22Updated 2 months ago
- An Extension for oobabooga/text-generation-webui☆36Updated last year
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 5 months ago
- A repository to store helpful information and emerging insights in regard to LLMs☆20Updated last year
- cli tool to quantize gguf, gptq, awq, hqq and exl2 models☆65Updated 3 weeks ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 4 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆40Updated last month
- 5X faster 60% less memory QLoRA finetuning☆21Updated 7 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 4 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- finetune your florence2 model easy☆20Updated 5 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated 9 months ago
- win32 native frontend for llama-cli☆12Updated 2 months ago