pkusys / RummyLinks
GPU-accelerated vector query processing system that supports large vector datasets beyond GPU memory.
☆30Updated last year
Alternatives and similar repositories for Rummy
Users that are interested in Rummy are comparing it to the libraries listed below
Sorting:
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆32Updated last year
- A low-latency, billion-scale, and updatable graph-based vector store on SSD.☆57Updated 2 weeks ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆29Updated 3 months ago
- ☆42Updated 2 months ago
- This is the implementation repository of our OSDI'23 paper: SMART: A High-Performance Adaptive Radix Tree for Disaggregated Memory.☆62Updated 9 months ago
- Query-Adaptive Vector Search☆47Updated 2 months ago
- ☆24Updated last year
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆60Updated 9 months ago
- Artifacts of EuroSys'24 paper "Exploring Performance and Cost Optimization with ASIC-Based CXL Memory"☆28Updated last year
- Vector search with bounded performance.☆36Updated last year
- FlashMob is a shared-memory random walk system.☆32Updated 2 years ago
- Code for "Baleen: ML Admission & Prefetching for Flash Caches" (FAST 2024).☆26Updated last year
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆35Updated 2 years ago
- A collection of awesome researchers and papers about disaggregated memory.☆163Updated 2 months ago
- Rcmp: Reconstructing RDMA-based Memory Disaggregation via CXL☆59Updated last year
- Deft: A Scalable Tree Index for Disaggregated Memory☆19Updated 4 months ago
- ☆44Updated 3 weeks ago
- ☆36Updated last year
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆52Updated 2 months ago
- Artifacts for our ASPLOS'23 paper ElasticFlow☆52Updated last year
- ☆22Updated last year
- ☆11Updated last year
- ☆38Updated last year
- ☆36Updated 2 months ago
- A code base for Vexless☆16Updated last year
- ☆28Updated last year
- This is the implementation repository of our SOSP'23 paper: Ditto: An Elastic and Adaptive Memory-Disaggregated Caching System.☆36Updated last year
- This is the implementation repository of our SOSP'24 paper: Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value …☆21Updated 10 months ago
- [SIGMOD 2025] PQCache: Product Quantization-based KVCache for Long Context LLM Inference☆66Updated 2 months ago
- [OSDI 2024] Motor: Enabling Multi-Versioning for Distributed Transactions on Disaggregated Memory☆50Updated last year