This project implements a demonstrator agent that compares the Cache-Augmented Generation (CAG) Framework with traditional Retrieval-Augmented Generation (RAG) using various LLMs.
☆32Dec 30, 2024Updated last year
Alternatives and similar repositories for CAG-Cache-Augmented-Generation
Users that are interested in CAG-Cache-Augmented-Generation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Demo of Cache-Augmented Generation (CAG) in an LLM☆123Jun 10, 2025Updated 10 months ago
- Comprehensive-Guide-for-Small-Language-Model-Development☆25Oct 20, 2024Updated last year
- An agent for medical reasoning☆10Oct 5, 2024Updated last year
- Designing a RAG pipeline using Gemma-2b, DSPy, and Qdrant☆10Mar 19, 2024Updated 2 years ago
- Exploring retrieval systems for language models☆14Apr 12, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Secure agents for real-world tasks☆54Apr 22, 2026Updated 2 weeks ago
- Utilize Autonomous AI Agents in a Project Management Office (PMO)☆28Apr 24, 2024Updated 2 years ago
- The official GitHub page for the survey paper "A Survey on LLM Symbolic Reasoning". And this paper is under review.☆32Mar 23, 2026Updated last month
- Sequential planner for large text based environments☆12Dec 13, 2023Updated 2 years ago
- This repository explores CAG and its integration with Granite language models, demonstrating how Granite’s extended context windows and p…☆13Sep 18, 2025Updated 7 months ago
- Streamlit Textcomplete - Autocomplete text in any textarea (HTMLTextAreaElement)☆17Oct 16, 2024Updated last year
- Self-hosted orchestration layer for autonomous AI agent teams. Shared memory, heartbeat scheduling, vault-first secrets, and cross-model …☆64Updated this week
- Super performant RAG pipeline for AI apps.☆17Mar 10, 2024Updated 2 years ago
- Applied Reinforcement Learning course☆14Feb 14, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A barely barebone NumPy implementation of Hierarchical Temporal Memory.☆11Mar 26, 2023Updated 3 years ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆63Dec 26, 2025Updated 4 months ago
- AI Travel Planner App Tutorial from TubeGuruji.☆10Mar 15, 2025Updated last year
- Just an UI for Chatterbox, which uses about 1-2 GB RAM. Double click and you're good to go.☆20Jun 3, 2025Updated 11 months ago
- White Cats define Pure functions☆16Nov 4, 2025Updated 6 months ago
- A chrome extension for tinder to reply using GPT with a personality.☆10Jan 4, 2024Updated 2 years ago
- Colab Notebook for SeamlessM4T model by Meta☆10Aug 23, 2023Updated 2 years ago
- GPT based autonomous agent that does online comprehensive research on any given topic☆13Aug 29, 2023Updated 2 years ago
- ☆13Jun 21, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,486May 26, 2025Updated 11 months ago
- A minimal Model Context Protocol 🖥️ server/client🧑💻with OpenAI and 🌐 web browser control via Playwright.☆33Apr 3, 2026Updated last month
- (Silver medal - 60th place - Top 3%) Repository for the "Tweet Sentiment Extraction" Kaggle competition.☆11Jun 18, 2020Updated 5 years ago
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated last year
- Open-Source Intelligent Command Layer☆92Updated this week
- Simulating Realistic Human Scanpaths in Dynamic Real-World Scenes☆15Mar 3, 2026Updated 2 months ago
- ☆12Oct 11, 2024Updated last year
- ☆28Aug 26, 2025Updated 8 months ago
- Hierarchical Framework for Interpretable Deep Reinforcement Learning Based- Predictive Maintenance (Applied to NASA Turbofan engine datas…☆14Feb 9, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is a simple calendar application that allows users to add, edit, and delete events.☆10Jan 28, 2025Updated last year
- A hover zoom effect to see a closer view of the image details.☆11Jan 13, 2025Updated last year
- [ICASSP'22] Continual Learning Benchmark for Spoken Keyword Spotting☆17Jun 7, 2022Updated 3 years ago
- Energy Based Models are a quite novel technique for density estimation. In this university project I explore this new research topic and …☆15Jul 6, 2021Updated 4 years ago
- qgpt-issue-31☆11Oct 31, 2024Updated last year
- ☆18Nov 29, 2024Updated last year
- AI Research Agent is a versatile application that leverages multiple tools to conduct thorough research on any topic.☆12Oct 12, 2024Updated last year