KioxiaAmerica / aisaq-diskann
All-in-Storage Solution based on DiskANN for DRAM-free Approximate Nearest Neighbor Search
☆55Updated 3 months ago
Alternatives and similar repositories for aisaq-diskann
Users that are interested in aisaq-diskann are comparing it to the libraries listed below
Sorting:
- InferX is a Inference Function as a Service Platform☆77Updated this week
- Lightweight Inference server for OpenVINO☆166Updated this week
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆52Updated 3 weeks ago
- A simple tool to anonymize LLM prompts.☆61Updated 3 months ago
- GPU Power and Performance Manager☆58Updated 7 months ago
- Rust crates for XetHub☆43Updated 7 months ago
- Use LLMs to robustly extract structured data from HTML and markdown☆28Updated this week
- High-speed and easy-use LLM serving framework for local deployment☆104Updated 2 months ago
- Simple node proxy for llama-server that enables MCP use☆13Updated last week
- Run and manage MCP servers as Docker/Podman containers. Inspired by Docker compose.☆24Updated last week
- OLLama IMage CAtegorizer☆67Updated 4 months ago
- Horizon chart for CPU/GPU/Neural Engine utilization monitoring on Apple M1/M2 and nVidia GPUs on Linux☆25Updated 3 weeks ago
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.☆46Updated this week
- Handy tool to measure the performance and efficiency of LLMs workloads.☆60Updated 3 weeks ago
- AI Tensor Engine for ROCm☆195Updated this week
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆61Updated this week
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- Minimal Linux OS with a Model Context Protocol (MCP) gateway to expose local capabilities to LLMs.☆98Updated this week
- ☆28Updated last year
- Simple high-throughput inference library☆46Updated this week
- Connects MCP to major 3D printer APIs (Orca, Bambu, OctoPrint, Klipper, Duet, Repetier, Prusa, Creality). Control prints, monitor status,…☆56Updated last week
- Inference code for LLaMA models☆42Updated 2 years ago
- RAG based agent with chDB(ClickHouse)☆14Updated this week
- ☆42Updated 9 months ago
- FalkorDB-Browser is a visualization UI for FalkorDB.☆30Updated this week
- DIS: blockDevice over Immutable Storage☆65Updated 3 years ago
- A collection of FIO test cases and a Python script to run them.☆10Updated last year
- Public API documentation from dependencies for AI coding assistants☆35Updated 3 months ago
- build your own vector database -- the littlest hnsw☆58Updated 4 months ago
- AI aware proxy☆18Updated 7 months ago