Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable windows, converts the window content into embeddings using Ollama, and stores these embeddings (along with the original text) in Elasticsearch
☆25Jun 7, 2025Updated 10 months ago
Alternatives and similar repositories for stream-rag-agent
Users that are interested in stream-rag-agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs, RAG pipelines, and beyond.☆66Apr 8, 2026Updated last week
- AI-powered text compression library for RAG systems and API calls. Reduce token usage by up to 50-60% while preserving semantic meaning w…☆83Aug 16, 2025Updated 8 months ago
- This project implements a Reinforcement Learning (RL) enhanced Retrieval-Augmented Generation (RAG) system that optimizes document retrie…☆24Apr 27, 2025Updated 11 months ago
- A VSCode extension for running LLM prompts. It turns VSCode into a powerful prompt IDE.☆31Oct 11, 2024Updated last year
- pdfLLM is a completely open source, proof of concept RAG app.☆186Sep 1, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Recursive Self-Aggregation evals on ARC-AGI☆29Jan 26, 2026Updated 2 months ago
- LiteLLM model integration for Pydantic AI framework - access 100+ LLM providers through a unified interface☆21Nov 19, 2025Updated 4 months ago
- Implementation of a fast semantic chunker in C++, installable in python 3.7+ projects.☆22Sep 20, 2025Updated 6 months ago
- ☆30Apr 23, 2025Updated 11 months ago
- Run GEPA on your favorite non-python libraries.☆34Jan 22, 2026Updated 2 months ago
- Using deep research workflow to generate datasets for finetuning LLMs.☆39Oct 9, 2025Updated 6 months ago
- Dual-layer memory for AI agents. Compressed index + vector store. 91% recall, 70ms, fully local.☆46Updated this week
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 7 months ago
- The rag pipeline for optimizing dynamic data editing.☆20Oct 30, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Request distributor for web scraping☆14Jun 8, 2025Updated 10 months ago
- CloudshipAI CLI☆48Dec 9, 2025Updated 4 months ago
- A proof-of-concept demonstrating a custom-built host implementing an OpenAI-compatible API with Google Vertex AI, function calling, and i…☆41Oct 15, 2025Updated 6 months ago
- the Go backend server of https://github.com/WarCluster/warcluster-client☆10Mar 17, 2016Updated 10 years ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- ☆10Nov 14, 2025Updated 5 months ago
- BlockRank makes LLMs efficient and scalable for RAG and in-context ranking☆43Dec 12, 2025Updated 4 months ago
- AI library that makes interfacing with ai easier as well as provide tooling around ai☆17Apr 8, 2026Updated last week
- An open-source AI agent that lives in your terminal.☆35Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Jun 4, 2025Updated 10 months ago
- An advanced retrieval system that combines semantic vector search with token-based search, using contextual chunking and knowledge graphs…☆47Oct 2, 2024Updated last year
- Code for my youtube video on building a local AI assistant with whisper turbo 3 and llama 3.2☆16Oct 21, 2024Updated last year
- A markdown knowledgebase search tool combining semantic search, BM25 keyword matching, and knowledge graph traversal with reciprocal rank…☆27Jan 31, 2026Updated 2 months ago
- This is a training method to produce a split brain model☆14Mar 7, 2025Updated last year
- Web app built for sharing experiences through images☆10Apr 3, 2021Updated 5 years ago
- A handy scaffolding tool for MCP servers☆16Nov 24, 2025Updated 4 months ago
- Python utilities☆18May 25, 2025Updated 10 months ago
- Go library for implementing the Model Context Protocol (MCP).☆16May 15, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- HashIndex: LLM-optimized Document Indexing without vector search☆43Jan 24, 2026Updated 2 months ago
- Implemention based on lightrag and nano-graphrag to connect with psql☆15Oct 28, 2024Updated last year
- A minimalist implementation of the ViT (Vision Transformer) model, using tinygrad☆16Sep 2, 2024Updated last year
- Connect, secure, control, and observe services.☆13Mar 30, 2026Updated 2 weeks ago
- Frona is a personal AI assistant. You create autonomous agents, give them tools, and talk to them through a chat interface. Agents act on…☆74Updated this week
- If you're using Burp Suite Community Edition and want to supercharge your workflow with some powerful AI assistance – without needing Bur…☆42Apr 16, 2025Updated last year
- A powerful starter template for building undetectable web scrapers and browser automation bots.☆57May 5, 2025Updated 11 months ago