lynnliu030 / artifact-evalLinks
☆8Updated 4 months ago
Alternatives and similar repositories for artifact-eval
Users that are interested in artifact-eval are comparing it to the libraries listed below
Sorting:
- ☆20Updated 3 months ago
- ☆12Updated 6 months ago
- Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.☆71Updated this week
- ☆79Updated 8 months ago
- Query-Adaptive Vector Search☆45Updated 2 months ago
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆22Updated last year
- Specification of the LDBC Financial Benchmark☆19Updated 10 months ago
- Modular and structured prompt caching for low-latency LLM inference☆98Updated 9 months ago
- ICDE 2023 Paper, GAR: A Generate-and-Rank Approach for Natural Language to SQL Translation☆19Updated last year
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆172Updated 10 months ago
- ☆16Updated 2 months ago
- ⚡ Faster similarity search with PDX: A vertical data layout for vectors☆51Updated this week
- [VLDB 25] Maximum Inner Product is Query-Scaled Nearest Neighbor☆29Updated 2 months ago
- MSVBASE is a system that efficiently supports complex queries of both approximate similarity search and relational operators. It integrat…☆97Updated 8 months ago
- Version of PBBS Benchmarks for VLDB 2024 Reviewers☆9Updated 2 years ago
- Code for MLSys 2024 Paper "SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models"☆20Updated last year
- EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"☆19Updated 5 months ago
- ☆22Updated this week
- ☆29Updated 5 months ago
- ☆47Updated last year
- LLM Serving Performance Evaluation Harness☆79Updated 5 months ago
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆123Updated 8 months ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆80Updated 5 months ago
- ☆40Updated 3 months ago
- Memory-Bounded GPU Acceleration for Vector Search☆27Updated 4 months ago
- A System for Optimized Semantic Computation☆127Updated this week
- ☆15Updated 2 months ago
- AskIt: Unified programming interface for programming with LLMs (GPT-3.5, GPT-4, Gemini, Claude, Cohere, Llama 2)☆79Updated 7 months ago
- ☆127Updated 3 weeks ago
- ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).☆39Updated 11 months ago