spirobel / bunny-llamaLinks

iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh

☆50

Alternatives and similar repositories for bunny-llama

Users that are interested in bunny-llama are comparing it to the libraries listed below

Sorting:

FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
distantmagic / structured
Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp
☆45Updated last year
kayvr / token-hawk
WebGPU LLM inference tuned by hand
☆151Updated 2 years ago
IntrinsicLabsAI / grammar-builder
Generates grammer files from typescript for LLM generation
☆38Updated last year
IntrinsicLabsAI / gbnfgen
TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces
☆139Updated last year
FL33TW00D / laserbeak
Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU
☆102Updated 2 years ago
cjpais / whisperfile
☆60Updated 11 months ago
ahoylabs / gguf.js
A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.
☆48Updated last year
kir-gadjello / zipslicer
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
remyxai / LocalMentor
Local Startup Advisor Chatbot
☆31Updated last year
cztomsik / ggml-js
JavaScript bindings for the ggml-js library
☆43Updated 4 months ago
blackhole89 / autopen
Editor with LLM generation tree exploration
☆73Updated 5 months ago
adrienbrault / json-schema-to-gbnf
Converts JSON-Schema to GBNF grammar to use with llama.cpp
☆55Updated last year
taylorai / onnx_embedding_models
utilities for loading and running text embeddings with onnx
☆44Updated last year
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆45Updated last year
jasonjmcghee / portable-hnsw
What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?
☆106Updated 3 months ago
iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆56Updated last year
PABannier / biogpt.cpp
Port of Microsoft's BioGPT in C/C++ using ggml
☆87Updated last year
muna-ai / muna-py
Run AI models anywhere. https://muna.ai/explore
☆63Updated last week
rahuldshetty / starcoder.js
Web browser version of StarCoder.cpp
☆45Updated 2 years ago
4dh / GRDN
GRDN.AI app for garden optimization
☆70Updated last year
asg017 / sqlite-rembed
A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)
☆132Updated 9 months ago
josephrocca / rwkv-v4-web
BlinkDL's RWKV-v4 running in the browser
☆47Updated 2 years ago
xenova / model-explorer
Browse, search, and visualize ONNX models.
☆33Updated 3 months ago
lsb / sqlite-vector-search
☆31Updated last year
GammaTauAI / opentau
Using Large Language Models for Repo-wide Type Prediction
☆111Updated last year
bwasti / brr.js
trying to make WebGPU a bit easier to use
☆16Updated last year
hora-search / hora-wasm
webassembly binding for Hora Approximate Nearest Neighbor Search Library
☆58Updated 3 years ago
charlesfrye / cuda-substrings
Because it's there.
☆16Updated 10 months ago