turbopuffer / tpuf-benchmarkLinks
General purpose benchmarking tool for turbopuffer deployments
☆15Updated last week
Alternatives and similar repositories for tpuf-benchmark
Users that are interested in tpuf-benchmark are comparing it to the libraries listed below
Sorting:
- Official Python API client library for turbopuffer☆67Updated last week
- Tantivy directory implementation backed by object_store☆36Updated last year
- The indexing service for ScyllaDB for vector searching functionality☆21Updated this week
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆20Updated 2 months ago
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated 2 years ago
- Securely run AI-generated code in stateful sandboxes that run forever.☆217Updated 3 months ago
- Fast block-level file diffs (e.g. for VM disk images) using CoW filesystem metadata☆136Updated last month
- xet client tech, used in huggingface_hub☆157Updated last week
- JAX infrastructure for model optimisation☆50Updated 2 weeks ago
- This repo tracks the opened and merged PRs by the top SWE coding agents by OpenAI, GitHub, and others. Updates every 3 hours.☆238Updated this week
- Official Rust Implementation of Model2Vec☆124Updated last month
- ☆137Updated last year
- Modal SDK for JavaScript/TS and Go [Alpha]☆39Updated this week
- build your own vector database -- the littlest hnsw☆61Updated 7 months ago
- HNSW implementation in Rust. Reference: https://arxiv.org/ftp/arxiv/papers/1603/1603.09320.pdf☆232Updated 8 months ago
- Structured LLM APIs☆156Updated last year
- SIMD quantization kernels☆79Updated this week
- Super-fast Structured Outputs☆417Updated this week
- Ask questions, let GPT do the SQL.☆132Updated 2 years ago
- ☆47Updated 2 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆28Updated 4 months ago
- ☆112Updated last year
- High-Performance Implementation of OpenAI's TikToken.☆444Updated last month
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- High-availability network proxy / VPN server, powered by WireGuard☆170Updated 6 months ago
- Tiny inference-only implementation of LLaMA☆93Updated last year
- High-performance key-value store for ML inference. 100x faster than Redis.☆221Updated last year
- Rust Implementation of micrograd☆52Updated last year
- An HTTP serving framework by Banana☆102Updated last year
- Infraless Database over any s3 storage API.☆20Updated last year