spirobel / bunny-llamaLinks
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆50Updated last year
Alternatives and similar repositories for bunny-llama
Users that are interested in bunny-llama are comparing it to the libraries listed below
Sorting:
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- ☆57Updated 10 months ago
- LLama implementations benchmarking framework☆12Updated last year
- Run Llama 2 using MLX on macOS☆34Updated last year
- llama.cpp gguf file parser for javascript☆43Updated 7 months ago
- ☆31Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- asynchronous/distributed speculative evaluation for llama3☆39Updated 11 months ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆137Updated last year
- Editor with LLM generation tree exploration☆71Updated 5 months ago
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated last year
- Browse, search, and visualize ONNX models.☆32Updated 2 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- webassembly binding for Hora Approximate Nearest Neighbor Search Library☆58Updated 3 years ago
- Local Startup Advisor Chatbot☆31Updated last year
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore☆64Updated last week
- Web browser version of StarCoder.cpp☆45Updated last year
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆106Updated 3 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated last year
- A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)☆129Updated 8 months ago
- Light WebUI for lm.rs☆24Updated 8 months ago
- moondream in zig.☆73Updated last month
- ☆31Updated last year
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆22Updated 3 months ago