Reliable and Efficient Semantic Prompt Caching with vCache
☆60Dec 17, 2025Updated 3 months ago
Alternatives and similar repositories for vCache
Users that are interested in vCache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Classic Chess game using x86 Assembly Language☆11Apr 23, 2019Updated 6 years ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 7 months ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated 2 months ago
- A supervised fine-tuning method for controllable reasoning length in large language models (一种通过有监督微调实现大语言模型思考长度可控的方法)☆10May 8, 2025Updated 10 months ago
- This is a repository for DKI group concerning the LLM-related papers alongside with code.☆32Feb 27, 2026Updated 3 weeks ago
- Example to read qr code with kotlin☆10Jul 24, 2018Updated 7 years ago
- About my PC setup and my scripts to automate workstation and server setup after a fresh OS install.☆16Dec 18, 2025Updated 3 months ago
- A four-dimensional Analysis of Partitioned Approximate Filters☆11Aug 6, 2025Updated 7 months ago
- Tools for Natural Language Processing☆12Feb 16, 2018Updated 8 years ago
- BinDex: A Two-Layered Index for Fast and Robust Scans (SIGMOD2020)☆10Jun 5, 2020Updated 5 years ago
- ☆19Jul 14, 2025Updated 8 months ago
- A Simple Algorithm for Minimum Cuts in Near-Linear Time (SWAT '20)☆12Apr 24, 2020Updated 5 years ago
- ☆21Jan 16, 2025Updated last year
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆23May 28, 2025Updated 9 months ago
- Official Repository for the ICLR 2022 paper "Generalization of Neural Combinatorial Solvers through the Lens of Adversarial Robustness"☆13Nov 20, 2022Updated 3 years ago
- Workflow Defined Engine☆25Nov 4, 2025Updated 4 months ago
- C++ Implementation of Zip Trees☆14Nov 5, 2022Updated 3 years ago
- A repository containing deep learning models and evaluation methods for enhancing medical image segmentation in Computed Tomography (CT) …☆20Jan 20, 2024Updated 2 years ago
- The official implementation of the iConference 2022 paper "Identifying Machine-Paraphrased Plagiarism".☆18Nov 19, 2022Updated 3 years ago
- Reproducibility package for "Robust Join Processing with Diamond Hardened Joins"☆12Jul 10, 2024Updated last year
- Offical Repository of MetaAgent Program☆43Dec 2, 2025Updated 3 months ago
- Implementation of Google Dremel's storage engine in a custom in-memory DB with query compilation.☆14Oct 10, 2020Updated 5 years ago
- Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"☆17Aug 4, 2020Updated 5 years ago
- Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."☆23Dec 23, 2024Updated last year
- Implementation of the DeepSqueeze paper: https://cs.brown.edu/people/acrotty/pubs/3318464.3389734.pdf☆12Oct 14, 2021Updated 4 years ago
- Implementation of the Generic Cell Rate Algorithm in C as a Redis Module☆19Jul 15, 2022Updated 3 years ago
- A tiny Flask app to provide access to Redis through a web form.☆17Jan 12, 2026Updated 2 months ago
- An implementation of the persistent skiplist based on Intel Optane Persistent Memory. It is with Intel's pmemkv as an storage engine☆13Apr 9, 2021Updated 4 years ago
- Code line highlighting for LaTeX with lstlisting (for beamer)☆15Nov 11, 2014Updated 11 years ago
- ☆18Sep 22, 2024Updated last year
- ☆13Jun 24, 2025Updated 8 months ago
- The Randomized Dependence Coefficient in Python☆20May 12, 2019Updated 6 years ago
- Redis module for secure password storage☆23May 18, 2016Updated 9 years ago
- Prize-Collecting Traveling Salesman Problem with Time Windows☆15Nov 8, 2020Updated 5 years ago
- learned cardinalities for databases☆16Apr 12, 2023Updated 2 years ago
- Implementation of an AVL tree in Java☆21Oct 1, 2020Updated 5 years ago
- Reproducibility package for "Two Birds With One Stone: Designing a Hybrid Cloud Storage Engine for HTAP"☆25Jul 10, 2024Updated last year
- A Comprehensive Survey on Long Context Language Modeling☆235Nov 24, 2025Updated 4 months ago
- Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data☆58Feb 5, 2026Updated last month