jbarrow / tinyhnswLinks
build your own vector database -- the littlest hnsw
☆61Updated 6 months ago
Alternatives and similar repositories for tinyhnsw
Users that are interested in tinyhnsw are comparing it to the libraries listed below
Sorting:
- Extremely memory-efficient vector database☆70Updated 9 months ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- Heirarchical Navigable Small Worlds☆97Updated 3 months ago
- Official Rust Implementation of Model2Vec☆122Updated last week
- A GPU Accelerated Binary Vector Store☆47Updated 4 months ago
- Vector Database with support for late interaction and token level embeddings.☆55Updated 3 weeks ago
- ☆57Updated 10 months ago
- HNSW tutorial☆141Updated last year
- Analyzing hacker news in real-time with Bytewax and Proton☆39Updated last year
- Exploration of Vector database Index for fast approximate nearest neighbour search.☆28Updated 11 months ago
- Generate BM25 sparse vector inside PostgreSQL☆81Updated 8 months ago
- Flowchart-like UI to interconnect LLM's and Huggingface models, and deploy them as a REST API with little to no code.☆72Updated 3 months ago
- ☆59Updated 3 months ago
- ☆27Updated 10 months ago
- ☆363Updated this week
- utilities for loading and running text embeddings with onnx☆44Updated 11 months ago
- High-Performance K-Means Clustering Library☆28Updated last week
- Official Python API client library for turbopuffer☆59Updated this week
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆187Updated 3 weeks ago
- Structured Output Is All You Need!☆57Updated last year
- My CUDA solution to the 1BRC☆10Updated last year
- llama.cpp gguf file parser for javascript☆43Updated 7 months ago
- A tiny version of GPT fully implemented in Python with zero dependencies☆72Updated 7 months ago
- A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)☆129Updated 8 months ago
- LLM-Powered Analyses of your GitHub Community using EvaDB☆24Updated last year
- Pivotal Token Search☆109Updated last week
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated 3 months ago
- Augment Swarm with durable execution to help you build reliable and scalable multi-agent systems.☆100Updated 8 months ago
- LLama implementations benchmarking framework☆12Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆137Updated 2 months ago