brittlewis12 / autoggufLinks
Easily convert HuggingFace models to GGUF-format for llama.cpp
☆21Updated 11 months ago
Alternatives and similar repositories for autogguf
Users that are interested in autogguf are comparing it to the libraries listed below
Sorting:
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆31Updated 2 months ago
- ☆27Updated last year
- run ollama & gguf easily with a single command☆51Updated last year
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 6 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 3 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆86Updated last month
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆36Updated 11 months ago
- Python package wrapping llama.cpp for on-device LLM inference☆72Updated this week
- ☆28Updated 10 months ago
- AirLLM 70B inference with single 4GB GPU☆14Updated this week
- ☆53Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆20Updated 8 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆93Updated last year
- ☆115Updated 6 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 10 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- ☆18Updated last year
- ☆24Updated 5 months ago
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆25Updated last month
- ☆23Updated 8 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆31Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆11Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 5 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week