spirobel / bunny-llamaLinks
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆50Updated 2 years ago
Alternatives and similar repositories for bunny-llama
Users that are interested in bunny-llama are comparing it to the libraries listed below
Sorting:
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆44Updated last year
- ☆62Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆107Updated 2 years ago
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆141Updated last year
- Local Startup Advisor Chatbot☆32Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆109Updated 8 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- ☆31Updated 2 years ago
- JavaScript bindings for the ggml-js library☆45Updated last month
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year
- Generates grammer files from typescript for LLM generation☆38Updated last year
- ☆14Updated last year
- Web browser version of StarCoder.cpp☆45Updated 2 years ago
- trying to make WebGPU a bit easier to use☆18Updated last year
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated 2 years ago
- tinygrad port of the RWKV large language model.☆45Updated 9 months ago
- webassembly binding for Hora Approximate Nearest Neighbor Search Library☆58Updated 4 years ago
- llama.cpp gguf file parser for javascript☆50Updated last year
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago
- Using Large Language Models for Repo-wide Type Prediction☆112Updated 2 years ago
- Browse, search, and visualize ONNX models.☆34Updated 7 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated 2 years ago
- utilities for loading and running text embeddings with onnx☆44Updated 4 months ago
- Tensor library for machine learning☆274Updated 2 years ago
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated 2 years ago