kioxia-jp / aisaq-diskannLinks
All-in-Storage Solution based on DiskANN for DRAM-free Approximate Nearest Neighbor Search
☆80Updated 3 months ago
Alternatives and similar repositories for aisaq-diskann
Users that are interested in aisaq-diskann are comparing it to the libraries listed below
Sorting:
- InferX: Inference as a Service Platform☆136Updated last week
- DCPerf benchmark suite for hyperscale cloud applications☆210Updated this week
- No-code CLI designed for accelerating ONNX workflows☆214Updated 4 months ago
- Horizon chart for CPU/GPU/Neural Engine utilization monitoring. Supports Apple M1-M4, Nvidia GPUs, AMD GPUs☆26Updated 2 months ago
- PacketMill: Toward per-core 100-Gbps Networking☆62Updated 3 years ago
- Inference code for LLaMA models☆42Updated 2 years ago
- Lightweight daemon for monitoring CUDA runtime API calls with eBPF uprobes☆131Updated 6 months ago
- ☆64Updated last year
- This is a landscape of the infrastructure that powers the generative AI ecosystem☆149Updated last year
- ☆14Updated last year
- Rust crates for XetHub☆70Updated last year
- CUDA checkpoint and restore utility☆376Updated last month
- DIS: blockDevice over Immutable Storage☆69Updated 3 years ago
- Build userspace NVMe drivers and storage applications with CUDA support☆393Updated last year
- High-performance safetensors model loader☆63Updated 3 months ago
- ☆39Updated this week
- Running SXM2/SXM3/SXM4 NVidia data center GPUs in consumer PCs☆126Updated 2 years ago
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS over OpenAI endpoints.☆213Updated last week
- Bamboo-7B Large Language Model☆93Updated last year
- The fastest ACID-transactional persisted Key-Value store designed as modified LSM-Tree for NVMe block-devices with GPU-acceleration and S…☆75Updated 2 years ago
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.☆80Updated last week
- NVIDIA Linux open GPU with P2P support☆60Updated last week
- AI/GPU flame graph☆188Updated last week
- Systematic and comprehensive benchmarks for LLM systems.☆38Updated 2 weeks ago
- Fast block-level file diffs (e.g. for VM disk images) using CoW filesystem metadata☆214Updated 3 months ago
- A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management.…☆71Updated this week
- ☆149Updated last week
- xet client tech, used in huggingface_hub☆297Updated last week
- NVIDIA GPUDirect Storage Driver☆292Updated 2 months ago
- Vector Database with support for late interaction and token level embeddings.☆55Updated 4 months ago