☆47Jun 7, 2024Updated last year
Alternatives and similar repositories for VTC-artifact
Users that are interested in VTC-artifact are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Apr 15, 2025Updated 11 months ago
- ☆28Updated this week
- ☆131Nov 11, 2024Updated last year
- ☆27Aug 31, 2023Updated 2 years ago
- ☆169Jul 15, 2025Updated 8 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- SpotServe: Serving Generative Large Language Models on Preemptible Instances☆134Feb 22, 2024Updated 2 years ago
- Serverless Paper Reading and Discussion☆38Jan 9, 2023Updated 3 years ago
- Compression for Foundation Models☆35Jul 21, 2025Updated 8 months ago
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆30Mar 5, 2024Updated 2 years ago
- Efficient and easy multi-instance LLM serving☆536Mar 12, 2026Updated 2 weeks ago
- Managed collective communication service☆24Sep 2, 2024Updated last year
- A low-latency & high-throughput serving engine for LLMs☆486Jan 8, 2026Updated 2 months ago
- ☆14Sep 8, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- a distributed computation platform for running Python and Bash computation tasks on multiple nodes☆12Mar 19, 2025Updated last year
- A large-scale simulation framework for LLM inference☆564Jul 25, 2025Updated 8 months ago
- ☆10Mar 15, 2026Updated 2 weeks ago
- Code and data for "Impact of Evaluation Methodologies on Code Summarization" in ACL 2022.☆10Sep 6, 2022Updated 3 years ago
- Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny BERT model can tell you the verbosity of an …☆49Jun 1, 2024Updated last year
- Disaggregated serving system for Large Language Models (LLMs).☆792Apr 6, 2025Updated 11 months ago
- ☆14Apr 1, 2023Updated 2 years ago
- ☆176Mar 12, 2024Updated 2 years ago
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference☆31Mar 19, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- API2Vec: Learning Representations of API Sequences for Malware Detection☆15Mar 10, 2024Updated 2 years ago
- The first range filter to simultaneously offer dynamicity, fast operations, and a robust false positive rate for any workload.☆12Jul 15, 2025Updated 8 months ago
- QJump NS patches and driver scripts☆13Jun 29, 2015Updated 10 years ago
- ☆10Apr 29, 2023Updated 2 years ago
- ☆15Apr 11, 2024Updated last year
- ☆87Oct 17, 2025Updated 5 months ago
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- From-Classification-to-Clinical☆12Apr 26, 2024Updated last year
- LLM-guided hyperparameter tuning☆10Oct 7, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆90Jun 16, 2025Updated 9 months ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆23Jan 6, 2026Updated 2 months ago
- ☆154Oct 9, 2024Updated last year
- Follow up to the DREBIN paper☆13Dec 27, 2018Updated 7 years ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Sep 19, 2024Updated last year
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆17Aug 21, 2023Updated 2 years ago
- An Observability Framework for AI Training☆69Updated this week