gotzmann / booster
Booster - open accelerator for LLM models. Better inference and debugging for AI hackers
☆150Updated 6 months ago
Alternatives and similar repositories for booster:
Users that are interested in booster are comparing it to the libraries listed below
- Binding to transformers in ggml☆60Updated 3 weeks ago
- RightHand - A GPT4 powered assistive tool.☆108Updated last month
- ☆15Updated 9 months ago
- ☆39Updated last month
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆117Updated last year
- ☆31Updated last year
- ☆38Updated 11 months ago
- Llama 2 inference in one file of pure Go☆103Updated last year
- The one who calls upon functions - Function-Calling Language Model☆36Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆74Updated last year
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆91Updated last year
- Local LLaMAs/Models in VSCode☆52Updated last year
- Something similar to Apple Intelligence?☆59Updated 7 months ago
- ZenModel is a framework for building LLM applications with agentic workflow☆63Updated 3 months ago
- Go bindings for HuggingFace Tokenizer☆108Updated 3 months ago
- 🐦 A open blazing-fast simple model gateway for rapid development of production GenAI apps☆141Updated 6 months ago
- Visual Studio Code extension for WizardCoder☆145Updated last year
- GPT-2 small trained on phi-like data☆65Updated 11 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆46Updated 6 months ago
- FastTensors - 100% Go framework for Neural Nets☆45Updated 4 months ago
- A simple vector database: Text encoding, semantic search, document storage☆87Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆23Updated last month
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Discord chatbot interface to train an LLM on user message history☆27Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆53Updated 11 months ago
- A simple GUI utility for gathering LIMA-like chat data.☆22Updated 3 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆52Updated this week
- ASR + diarization model server with speculative decoding☆54Updated 8 months ago