cjpais / whisperfile
☆53Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for whisperfile
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆46Updated last year
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆43Updated 6 months ago
- Local Startup Advisor Chatbot☆26Updated 10 months ago
- A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)☆86Updated 2 weeks ago
- A SQLite extension for generate text embeddings from GGUF models using llama.cpp☆130Updated last month
- GGML implementation of BERT model with Python bindings and quantization.☆51Updated 9 months ago
- Extremely memory-efficient vector database☆57Updated 2 months ago
- Web browser version of StarCoder.cpp☆43Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆54Updated 7 months ago
- ☆31Updated 10 months ago
- Self-hosted LLM chatbot arena, with yourself as the only judge☆36Updated 9 months ago
- Light WebUI for lm.rs☆22Updated last month
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆47Updated last month
- HTTP proxy for on-demand model loading with llama.cpp (or other OpenAI compatible backends)☆41Updated this week
- What if an HNSW index was just a file, and you could serve it from a CDN, and search it directly in the browser?☆85Updated 6 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 11 months ago
- Distribute and run llamafile/LLMs with a single docker image.☆64Updated 2 weeks ago
- AnyModal is a Flexible Multimodal Language Model Framework☆40Updated this week
- ☆38Updated 8 months ago
- Embedding models from Jina AI☆56Updated 10 months ago
- Something similar to Apple Intelligence?☆57Updated 4 months ago
- Rust implementation of Surya☆51Updated last month
- A live multiplayer trivia game where users can bid for the subject of the next question☆22Updated 2 weeks ago
- Visual inference exploration & experimentation playground☆76Updated this week
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated 9 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆12Updated 6 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- Fast, SQL powered, in-process vector search for any language with an SQLite driver☆268Updated 2 weeks ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆44Updated 5 months ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆131Updated 4 months ago