gotzmann / booster
Booster - open accelerator for LLM models. Better inference and debugging for AI hackers
☆154Updated 7 months ago
Alternatives and similar repositories for booster:
Users that are interested in booster are comparing it to the libraries listed below
- Binding to transformers in ggml☆60Updated this week
- Port of Facebook's LLaMA (Large Language Model Meta AI) in Golang with embedded C/C++☆167Updated last year
- Llama 2 inference in one file of pure Go☆104Updated last year
- FastTensors - 100% Go framework for Neural Nets☆49Updated 6 months ago
- NLP transformers written in Go☆227Updated 2 years ago
- RightHand - A GPT4 powered assistive tool.☆109Updated 2 months ago
- ☆16Updated 10 months ago
- ZenModel is a framework for building LLM applications with agentic workflow☆65Updated 5 months ago
- Go bindings for HuggingFace Tokenizer☆126Updated this week
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 6 months ago
- Inference Llama 2 in Go☆39Updated last year
- Go client for txtai☆76Updated 2 weeks ago
- Neural Language Model for Go☆59Updated last year
- A simple GUI utility for gathering LIMA-like chat data.☆23Updated 3 weeks ago
- Visual Studio Code extension for WizardCoder☆147Updated last year
- A simple vector database: Text encoding, semantic search, document storage☆88Updated last year
- 4 bits quantization of SantaCoder using GPTQ☆51Updated last year
- ☆38Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆122Updated last year
- Turn natual language into commands. Your CLI tasks, now as easy as a conversation. Run it 100% offline, or use OpenAI's models.☆56Updated 9 months ago
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- Production grade LLM-ops in Golang☆55Updated this week
- 🐦 A open blazing-fast simple model gateway for rapid development of production GenAI apps☆142Updated 7 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆55Updated last month
- GPT-2 small trained on phi-like data☆65Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆65Updated last year
- Conveniently download files, models, tokenizers from HuggingFace Hub☆18Updated last month
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆44Updated 10 months ago
- DSPy Go implementation☆27Updated this week
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year