spirobel / bunny-llama
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆47Updated last year
Alternatives and similar repositories for bunny-llama:
Users that are interested in bunny-llama are comparing it to the libraries listed below
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆43Updated 9 months ago
- Web browser version of StarCoder.cpp☆43Updated last year
- ☆53Updated 6 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- Local Startup Advisor Chatbot☆31Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆53Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆46Updated 6 months ago
- ☆31Updated last year
- GRDN.AI app for garden optimization☆70Updated last year
- Light WebUI for lm.rs☆23Updated 4 months ago
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore☆39Updated last week
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- Tensor library for Zig☆11Updated 3 months ago
- A fork of llama3.c used to do some R&D on inferencing☆18Updated 2 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆15Updated last week
- ☆14Updated 2 months ago
- LLama implementations benchmarking framework☆12Updated last year
- emoji_finder☆15Updated last month
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated last year
- ☆34Updated last year
- Editor with LLM generation tree exploration☆62Updated last week
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 6 months ago
- Port of Facebook's LLaMA model in C/C++☆32Updated 11 months ago
- ☆25Updated 2 months ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated 11 months ago
- ☆22Updated 8 months ago
- WebGPU LLM inference tuned by hand☆148Updated last year