gotzmann / booster
Booster - open accelerator for LLM models. Better inference and debugging for AI hackers
☆155Updated 8 months ago
Alternatives and similar repositories for booster
Users that are interested in booster are comparing it to the libraries listed below
Sorting:
- Binding to transformers in ggml☆61Updated this week
- Port of Facebook's LLaMA (Large Language Model Meta AI) in Golang with embedded C/C++☆167Updated last year
- A simple vector database: Text encoding, semantic search, document storage☆90Updated last year
- A go wrapper around the rwkv.cpp library☆20Updated last year
- A lightweight proxy for filtering `<think>` tags from any OpenAI-compatible API endpoint. Designed for chain-of-thought language models t…☆37Updated 3 months ago
- ☆16Updated last year
- ☆31Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 8 months ago
- ☆38Updated last year
- FastTensors - 100% Go framework for Neural Nets☆51Updated 7 months ago
- Local LLaMAs/Models in VSCode☆53Updated last year
- NLP transformers written in Go☆230Updated 2 years ago
- Llama 2 inference in one file of pure Go☆104Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆123Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- GPT-2 small trained on phi-like data☆66Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- LLaVA server (llama.cpp).☆180Updated last year
- Memory is a long term memory for your own llm model☆17Updated last year
- Inference Llama 2 in Go☆39Updated last year
- The one who calls upon functions - Function-Calling Language Model☆36Updated last year
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 11 months ago
- Distributed Inference for mlx LLm☆91Updated 9 months ago
- Neural Language Model for Go☆59Updated last year
- Inference Llama 2 in one file of pure go☆16Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆135Updated 10 months ago