REFRAG-style RAG (compress → sense/select → expand) — Single-file reference implementation
☆214Dec 26, 2025Updated 2 months ago
Alternatives and similar repositories for REFRAG
Users that are interested in REFRAG are comparing it to the libraries listed below
Sorting:
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆27Feb 13, 2026Updated 2 weeks ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 8 months ago
- Build Neo4J Knowledge Graphs from Excel files☆22Nov 18, 2024Updated last year
- PyTorch code for DeepTime: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting☆11Jan 9, 2023Updated 3 years ago
- ☆23Feb 22, 2026Updated last week
- Turn your Claude Code subscription to an OpenAI API compatible provider☆27Feb 20, 2026Updated last week
- ☆37Nov 14, 2025Updated 3 months ago
- ☆20Jun 16, 2025Updated 8 months ago
- ☆42Jan 6, 2025Updated last year
- Examples of fine-tuning LLMs☆19Oct 27, 2025Updated 4 months ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆37Sep 1, 2025Updated 6 months ago
- ☆16Apr 28, 2024Updated last year
- ☆23Nov 24, 2025Updated 3 months ago
- SQL and AI Workshop☆26Feb 13, 2025Updated last year
- 阿里云天池 - GLM 法律行业大模型挑战赛 - 我们小组实现基于大模型的对话机器人源码☆17Oct 23, 2024Updated last year
- Dive into LLM Agents☆18Jun 1, 2024Updated last year
- ☆21Updated this week
- Test Environment Booking tool☆14Nov 16, 2020Updated 5 years ago
- ☆11Updated this week
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- Zero-human, cold-start construction of long-chain agents in professional domains☆48Nov 10, 2025Updated 3 months ago
- Vector databases for generative AI☆22Apr 23, 2024Updated last year
- Ralph Loop based on Ryan's work for Claude Code | 基于Ryan版本实现的Ralph Loop,面向Claude Code☆44Jan 29, 2026Updated last month
- Control your computer with a voice interface☆29Nov 12, 2025Updated 3 months ago
- ☆19Nov 5, 2024Updated last year
- deepseek思维树模式实现☆22Jul 17, 2025Updated 7 months ago
- ☆52May 13, 2025Updated 9 months ago
- ☆28Oct 18, 2024Updated last year
- Language Model for Mainframe Modernization☆68Aug 23, 2024Updated last year
- OpenAI WebRTC example app: Realtime API voice chat app, built with React/Next.js☆32Jan 3, 2025Updated last year
- 由中国政法大学和北京航空航天大学共同设计,基于GLM-9B的法律文书处理和判决预测模型☆29Sep 6, 2024Updated last year
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆32Jan 11, 2025Updated last year
- Dockerized LLM inference server with constrained output (JSON mode), built on top of vLLM and outlines. Faster, cheaper and without rate …☆27Feb 17, 2024Updated 2 years ago
- Includes examples on how to evaluate LLMs☆23Nov 4, 2024Updated last year
- Agent Sandbox is an E2B compatible, enterprise-grade ai-first, cloud-native runtime environment for AI Agents. Allows Agents to securely …☆66Jan 30, 2026Updated last month
- This is system where images are trained and recognize of bumch of faces at a time☆23Oct 25, 2025Updated 4 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Aug 24, 2024Updated last year
- ☆28Apr 28, 2024Updated last year
- AI-powered draw.io diagram generator for Claude Code. Generate flowcharts, architecture diagrams, mind maps from natural language with br…☆54Jan 13, 2026Updated last month