A Demo of Cache-Augmented Generation (CAG) in an LLM
☆123Jun 10, 2025Updated 8 months ago
Alternatives and similar repositories for cacheaugmentedgeneration
Users that are interested in cacheaugmentedgeneration are comparing it to the libraries listed below
Sorting:
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,471May 26, 2025Updated 9 months ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated last month
- Exploring retrieval systems for language models☆14Apr 12, 2025Updated 10 months ago
- Generative AI in realtime with Confluent Cloud.☆28Apr 16, 2024Updated last year
- BlockRank makes LLMs efficient and scalable for RAG and in-context ranking☆42Dec 12, 2025Updated 2 months ago
- This Repository demostrates various examples using YOLO☆13Feb 9, 2024Updated 2 years ago
- ☆15Feb 1, 2025Updated last year
- ☆45Feb 13, 2026Updated 3 weeks ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- ☆16Mar 23, 2025Updated 11 months ago
- ☆43Feb 11, 2025Updated last year
- Deploying full-stack on-prem deep research agent that can be run entirely on a local machine for $0!☆33Nov 8, 2025Updated 3 months ago
- An agent to generate stunning images :)☆23May 22, 2025Updated 9 months ago
- WebRAgent is a retrieval-augmented generation (RAG) web application featuring agent-based query decomposition, vector search with Qdrant,…☆54Mar 22, 2025Updated 11 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 9 months ago
- Retrieval-augmented generation (RAG) for remote & local LLM use☆44May 24, 2025Updated 9 months ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 4 months ago
- MCP server for integrating OpenAI's Deep Research APIs and Hugging Face's Open Deep Research with Claude Code and other AI assistants☆45Feb 10, 2026Updated 3 weeks ago
- Implementation of the paper "Improving Multi-step RAG with Hypergraph-based Memory for Long-context Complex Relational Modeling"☆115Jan 22, 2026Updated last month
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆56Feb 11, 2026Updated 3 weeks ago
- A simple, easy-to-customize pipeline for local RAG evaluation. Starter prompts and metric definitions included.☆25Jan 14, 2026Updated last month
- The Journey of RAG: From Notebook to Microservices☆26Feb 22, 2024Updated 2 years ago
- Custom hooks for pi coding agent☆56Feb 23, 2026Updated last week
- TheNZT is a powerful multi-agent finance query processing system designed to process and respond to finance-related queries efficiently. …☆30Feb 3, 2026Updated last month
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆26Dec 20, 2024Updated last year
- ☆23Oct 28, 2024Updated last year
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆64Dec 10, 2025Updated 2 months ago
- RapidFire AI: Rapid AI Customization from RAG to Fine-Tuning☆141Updated this week
- ☆33Jan 30, 2025Updated last year
- Production-ready Python library for multi-provider LLM orchestration☆40Oct 10, 2025Updated 4 months ago
- Analyze Reddit posts☆30Feb 27, 2025Updated last year
- ☆50Jun 18, 2025Updated 8 months ago
- GRACE (Graph-RAG Anchored Code Engineering): open Agent Skills for contract-driven AI code generation with semantic markup, knowledge gr…☆57Updated this week
- ☆36Feb 28, 2026Updated last week
- This provide A Zero-Server Web Interface for use with Ollama local LLM's and provides AI search via Perplexity (API Key required) and ima…☆33Feb 9, 2026Updated 3 weeks ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- ☆30Apr 23, 2025Updated 10 months ago
- This repository contains an implementation of the 3D watermarking algorithm proposed by Cayre et al based on Spectral Decomposition.☆11Jun 3, 2018Updated 7 years ago