cjpais / whisperfileLinks
☆55Updated 9 months ago
Alternatives and similar repositories for whisperfile
Users that are interested in whisperfile are comparing it to the libraries listed below
Sorting:
- Editor with LLM generation tree exploration☆67Updated 3 months ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆54Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆47Updated last year
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆54Updated last month
- Official Rust Implementation of Model2Vec☆108Updated this week
- A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)☆122Updated 6 months ago
- Extremely memory-efficient vector database☆68Updated 8 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆62Updated this week
- Light WebUI for lm.rs☆23Updated 7 months ago
- Port of Suno AI's Bark in C/C++ for fast inference☆51Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 9 months ago
- Heirarchical Navigable Small Worlds☆96Updated last month
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆46Updated 10 months ago
- A fork of llama3.c used to do some R&D on inferencing☆22Updated 5 months ago
- Web browser version of StarCoder.cpp☆44Updated last year
- Run GGML models with Kubernetes.☆172Updated last year
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated 4 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Embedding models from Jina AI☆60Updated last year
- llm plugin for Cerebras fast inference API☆26Updated 2 months ago
- Interpolate between embedding points with llm☆37Updated 10 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 6 months ago
- llama.cpp gguf file parser for javascript☆42Updated 5 months ago
- A SQLite extension for generating text embeddings from GGUF models using llama.cpp☆190Updated 6 months ago
- The DPAB-α Benchmark☆23Updated 4 months ago
- Visual inference exploration & experimentation playground☆94Updated 6 months ago
- LLM plugin providing access to Mistral models using the Mistral API☆180Updated this week
- moondream in zig.☆69Updated this week