lynnliu030 / artifact-evalLinks
☆7Updated 2 months ago
Alternatives and similar repositories for artifact-eval
Users that are interested in artifact-eval are comparing it to the libraries listed below
Sorting:
- ☆12Updated 5 months ago
- Query-Adaptive Vector Search☆35Updated 3 weeks ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆21Updated last year
- Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.☆54Updated last week
- ☆47Updated last year
- MSVBASE is a system that efficiently supports complex queries of both approximate similarity search and relational operators. It integrat…☆94Updated 7 months ago
- Version of PBBS Benchmarks for VLDB 2024 Reviewers☆9Updated 2 years ago
- ☆32Updated last year
- EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"☆18Updated 3 months ago
- ☆28Updated 4 months ago
- ☆13Updated 2 months ago
- [ICDE 2024] VDTuner - Automated Performance Tuning for Vector Data Management Systems (Vector Databases)☆28Updated last year
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆79Updated 3 months ago
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆20Updated last year
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆16Updated last year
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.☆29Updated last month
- Stateful LLM Serving☆73Updated 3 months ago
- ☆121Updated 5 months ago
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆164Updated 9 months ago
- DB-BERT tunes database systems for optimal performance, using tuning hints mined from text.☆61Updated last year
- ⚡ Faster vector search with PDX: A vertical data layout for vectors☆39Updated 2 weeks ago
- ☆67Updated 8 months ago
- A resilient distributed training framework☆95Updated last year
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆116Updated 6 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆28Updated 4 months ago
- An experimentation platform for LLM inference optimisation☆31Updated 9 months ago
- A lightweight, user-friendly data-plane for LLM training.☆19Updated 2 months ago
- The driver for LMCache core to run in vLLM☆42Updated 4 months ago
- Code for MLSys 2024 Paper "SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models"☆18Updated last year
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆48Updated 7 months ago