pkusys / AuncelLinks
Vector search with bounded performance.
☆36Updated last year
Alternatives and similar repositories for Auncel
Users that are interested in Auncel are comparing it to the libraries listed below
Sorting:
- Reading seminar in Harvard Cloud Networking and Systems Group☆16Updated 2 years ago
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆73Updated 2 years ago
- A Progam-Behavior-Guided Far Memory System☆35Updated last year
- Artifacts of EuroSys'24 paper "Exploring Performance and Cost Optimization with ASIC-Based CXL Memory"☆28Updated last year
- Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆25Updated 8 months ago
- This is the implementation repository of our FAST'23 paper: FUSEE: A Fully Memory-Disaggregated Key-Value Store.☆59Updated 2 years ago
- This is the implementation repository of our OSDI'23 paper: SMART: A High-Performance Adaptive Radix Tree for Disaggregated Memory.☆62Updated 9 months ago
- This is the implementation repository of our SOSP'24 paper: Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value …☆21Updated 9 months ago
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆32Updated last year
- Johnny Cache: the End of DRAM Cache Conflicts (in Tiered Main Memory Systems)☆18Updated 2 years ago
- Arya: Arbitrary Graph Pattern Mining with Decomposition-based Sampling☆13Updated last year
- ☆36Updated last year
- Virtual Memory Abstraction for Serverless Architectures☆48Updated 3 years ago
- Efficient Compute-Communication Overlap for Distributed LLM Inference☆26Updated last month
- A Skew-Resistant Index for Processing-in-Memory☆25Updated 10 months ago
- A Memory-Disaggregated Managed Runtime.☆66Updated 3 years ago
- website for systems seminar at UIUC☆20Updated last month
- Tigon: A Distributed Database for a CXL Pod [OSDI '25]☆28Updated last month
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆54Updated 3 years ago
- MemLiner is a remote-memory-friendly runtime system.☆31Updated 2 years ago
- Medusa: Accelerating Serverless LLM Inference with Materialization [ASPLOS'25]☆28Updated 2 months ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆42Updated 3 years ago
- [OSDI 2024] Motor: Enabling Multi-Versioning for Distributed Transactions on Disaggregated Memory☆49Updated last year
- EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"☆19Updated 4 months ago
- A preemptive scheduling framework for diverse XPUs, including GPUs, NPUs, ASICs, and FPGAs☆74Updated last week
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆35Updated 2 years ago
- Implementation of the logging layer of our SOSP '23 paper Halfmoon☆11Updated 2 years ago
- Artifact evaluation repo for EuroSys'24.☆27Updated last year
- The Artifact Evaluation Version of SOSP Paper #19☆50Updated 11 months ago
- MeshInsight: Dissecting Overheads of Service Mesh Sidecars☆47Updated last year