spirobel / bunny-llama
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆48Updated last year
Alternatives and similar repositories for bunny-llama:
Users that are interested in bunny-llama are comparing it to the libraries listed below
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated 11 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- WebGPU LLM inference tuned by hand☆149Updated last year
- Web browser version of StarCoder.cpp☆45Updated last year
- asynchronous/distributed speculative evaluation for llama3☆39Updated 8 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- ☆54Updated 8 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Editor with LLM generation tree exploration☆66Updated 2 months ago
- Light WebUI for lm.rs☆23Updated 6 months ago
- Port of Facebook's LLaMA model in C/C++☆32Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 9 months ago
- Tensor library for Zig☆12Updated 5 months ago
- Browse, search, and visualize ONNX models.☆24Updated this week
- LLama implementations benchmarking framework☆12Updated last year
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆52Updated last year
- ☆14Updated 5 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆88Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- ☆22Updated 11 months ago
- tinygrad port of the RWKV large language model.☆44Updated last month
- ANE accelerated embedding models!☆16Updated 4 months ago
- ☆20Updated last month
- ☆35Updated 2 years ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆30Updated last year
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- LLaVA server (llama.cpp).☆180Updated last year
- ☆40Updated 2 years ago
- Local Startup Advisor Chatbot☆31Updated last year