gotzmann / booster
Booster - open accelerator for LLM models. Better inference and debugging for AI hackers
☆147Updated 5 months ago
Alternatives and similar repositories for booster:
Users that are interested in booster are comparing it to the libraries listed below
- Binding to transformers in ggml☆60Updated 2 weeks ago
- ☆16Updated 9 months ago
- Port of Facebook's LLaMA (Large Language Model Meta AI) in Golang with embedded C/C++☆163Updated last year
- A fast batching API to serve LLM models☆180Updated 9 months ago
- The one who calls upon functions - Function-Calling Language Model☆36Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Something similar to Apple Intelligence?☆58Updated 6 months ago
- 🐦 A open blazing-fast simple model gateway for rapid development of production GenAI apps☆140Updated 5 months ago
- Neural Language Model for Go☆58Updated last year
- ☆25Updated last week
- Inference Llama 2 in one file of pure go☆16Updated last year
- ☆38Updated 10 months ago
- A simple vector database: Text encoding, semantic search, document storage☆87Updated last year
- GPT-2 small trained on phi-like data☆65Updated 11 months ago
- A go wrapper around the rwkv.cpp library☆20Updated 10 months ago
- FastTensors - 100% Go framework for Neural Nets☆44Updated 4 months ago
- RightHand - A GPT4 powered assistive tool.☆107Updated 2 weeks ago
- 4 bits quantization of SantaCoder using GPTQ☆53Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆74Updated last year
- A simple GUI utility for gathering LIMA-like chat data.☆22Updated 2 months ago
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆49Updated 3 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆76Updated this week
- GGML implementation of BERT model with Python bindings and quantization.☆53Updated 11 months ago
- Go module for fetching embeddings from embeddings providers☆43Updated last month
- A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)☆103Updated 2 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆67Updated 4 months ago
- Llama 2 inference in one file of pure Go☆105Updated last year
- ☆36Updated last month
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated 3 months ago
- Distributed Inference for mlx LLm☆79Updated 5 months ago