brittlewis12 / autogguf
Easily convert HuggingFace models to GGUF-format for llama.cpp
☆21Updated 7 months ago
Alternatives and similar repositories for autogguf:
Users that are interested in autogguf are comparing it to the libraries listed below
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- LLM backed Fantasy Tribe Game☆18Updated 3 months ago
- ☆27Updated last year
- run ollama & gguf easily with a single command☆49Updated 9 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated 11 months ago
- Modified Beam Search with periodical restart☆12Updated 5 months ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆50Updated last month
- A repository to store helpful information and emerging insights in regard to LLMs☆20Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated 11 months ago
- Mistral7B playing DOOM☆28Updated 11 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 2 months ago
- ☆21Updated 4 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆31Updated 7 months ago
- An unsupervised model merging algorithm for Transformers-based language models.☆106Updated 10 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆56Updated 3 months ago
- Text generation in Python, as easy as possible☆54Updated this week
- Yet Another (LLM) Web UI, made with Gemini☆11Updated 2 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 3 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 6 months ago
- entropix style sampling + GUI☆25Updated 4 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆21Updated 2 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated 9 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 3 months ago
- Build HTML artefacts with Ollama