spirobel / bunny-llama
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆46Updated last year
Related projects ⓘ
Alternatives and complementary repositories for bunny-llama
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 11 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆43Updated 6 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- ☆21Updated 5 months ago
- LLama implementations benchmarking framework☆12Updated last year
- utilities for loading and running text embeddings with onnx☆39Updated 3 months ago
- GRDN.AI app for garden optimization☆69Updated 9 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 9 months ago
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆86Updated 6 months ago
- Web browser version of StarCoder.cpp☆43Updated last year
- ☆53Updated 3 months ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Updated 9 months ago
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 2 months ago
- Local Startup Advisor Chatbot☆26Updated 10 months ago
- Run `npm i -g socrate` to install a discussion room for using GPT personalities with internal monologues to debate problems. Provide a pr…☆27Updated last year
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated last year
- Generates grammer files from typescript for LLM generation☆34Updated 9 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆57Updated 4 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 6 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆45Updated 3 months ago
- emoji_finder☆15Updated 2 months ago
- Latent Large Language Models☆16Updated 3 months ago
- Using modal.com to process FineWeb-edu data☆19Updated 2 months ago
- ☆25Updated 10 months ago
- ☆34Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated 5 months ago