spirobel / bunny-llamaLinks
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆49Updated last year
Alternatives and similar repositories for bunny-llama
Users that are interested in bunny-llama are comparing it to the libraries listed below
Sorting:
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- Local Startup Advisor Chatbot☆31Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- Light WebUI for lm.rs☆23Updated 8 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore☆64Updated this week
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 10 months ago
- ANE accelerated embedding models!☆18Updated 6 months ago
- llama.cpp gguf file parser for javascript☆42Updated 6 months ago
- LLama implementations benchmarking framework☆12Updated last year
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆30Updated last year
- Web browser version of StarCoder.cpp☆45Updated last year
- ☆56Updated 10 months ago
- Browse, search, and visualize ONNX models.☆32Updated last month
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated last year
- Generates grammer files from typescript for LLM generation☆38Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated 2 years ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated last year
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated last year
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- moondream in zig.☆71Updated 3 weeks ago
- Repair incomplete JSON (e.g. from streaming APIs or AI models) so it can be parsed as it's received.☆34Updated last year
- ☆31Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated 2 years ago
- ☆35Updated 2 years ago
- Run Llama 2 using MLX on macOS☆34Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated last year
- ☆40Updated 2 years ago