pkusys / AuncelLinks
Vector search with bounded performance.
☆36Updated last year
Alternatives and similar repositories for Auncel
Users that are interested in Auncel are comparing it to the libraries listed below
Sorting:
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆74Updated 2 years ago
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Updated 2 years ago
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆25Updated 9 months ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆29Updated 3 months ago
- This is the implementation repository of our SOSP'24 paper: Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value …☆21Updated 10 months ago
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆32Updated last year
- This is the implementation repository of our OSDI'23 paper: SMART: A High-Performance Adaptive Radix Tree for Disaggregated Memory.☆62Updated 9 months ago
- ☆36Updated last year
- A Progam-Behavior-Guided Far Memory System☆35Updated last year
- Artifacts of EuroSys'24 paper "Exploring Performance and Cost Optimization with ASIC-Based CXL Memory"☆28Updated last year
- A Skew-Resistant Index for Processing-in-Memory☆25Updated 10 months ago
- Arya: Arbitrary Graph Pattern Mining with Decomposition-based Sampling☆13Updated last year
- This is the implementation repository of our FAST'23 paper: FUSEE: A Fully Memory-Disaggregated Key-Value Store.☆59Updated 2 years ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆35Updated 2 years ago
- MemLiner is a remote-memory-friendly runtime system.☆31Updated 2 years ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆42Updated 3 years ago
- Query-Adaptive Vector Search☆47Updated 2 months ago
- Deduplication over dis-aggregated memory for Serverless Computing☆14Updated 3 years ago
- Tigon: A Distributed Database for a CXL Pod [OSDI '25]☆28Updated 2 months ago
- [OSDI 2024] Motor: Enabling Multi-Versioning for Distributed Transactions on Disaggregated Memory☆50Updated last year
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆55Updated 3 years ago
- Artifact evaluation repo for EuroSys'24.☆27Updated last year
- GPU-accelerated vector query processing system that supports large vector datasets beyond GPU memory.☆30Updated last year
- ☆28Updated last year
- This is the source code for our (Tobias Ziegler, Jacob Nelson-Slivon, Carsten Binnig and Viktor Leis) published paper at SIGMOD’23: Desig…☆27Updated 11 months ago
- ☆42Updated 2 months ago
- Efficient Compute-Communication Overlap for Distributed LLM Inference☆30Updated last month
- Virtual Memory Abstraction for Serverless Architectures☆48Updated 3 years ago
- Johnny Cache: the End of DRAM Cache Conflicts (in Tiered Main Memory Systems)☆19Updated 2 years ago
- The Artifact Evaluation Version of SOSP Paper #19☆50Updated last year