Measuring RAG solutions throughput and latency
☆19Jul 23, 2024Updated last year
Alternatives and similar repositories for RAG-Performance
Users that are interested in RAG-Performance are comparing it to the libraries listed below
Sorting:
- ☆11Aug 26, 2024Updated last year
- A structured framework for defining, verifying and certifying AI systems.☆17Mar 11, 2025Updated last year
- react + next.js dashboard for R2R: The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a REST…☆173May 2, 2025Updated 10 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆19Feb 27, 2023Updated 3 years ago
- ☆15Feb 27, 2025Updated last year
- Anki Flashcards from PDFs with AI☆12May 21, 2024Updated last year
- Large Multimodal Model☆15Apr 8, 2024Updated last year
- ☆21Oct 6, 2023Updated 2 years ago
- ☆15May 30, 2023Updated 2 years ago
- ☆20Dec 13, 2024Updated last year
- GraphRAG for Second Brain. Ingest knowledge -> build knowledge graphs -> Query relevant knowledge | Explore connections☆21Jun 5, 2025Updated 9 months ago
- Autonomous AI backend for deep research AI applications.☆42Mar 12, 2026Updated last week
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆44Jan 23, 2026Updated last month
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- 🔥 热点聚合推送平台 - 聚合微博、知乎、B站等 13+ 平台热榜,支持 Telegram、Discord、企业微信等多渠道推送☆67Feb 28, 2026Updated 3 weeks ago
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- A highly contextualized retrieval system integrating Large Language Models (LLMs), embeddings, and a dynamic agent-driven framework. Supp…☆27Sep 24, 2025Updated 5 months ago
- A repository of Juris-M style modules☆16Jan 17, 2024Updated 2 years ago
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4j☆18Aug 19, 2024Updated last year
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆42Apr 29, 2023Updated 2 years ago
- ☆17Aug 5, 2025Updated 7 months ago
- Deep learning-based multimodal integration of histology and genomics to improves cancer origin prediction☆28Mar 28, 2023Updated 2 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- Algofi protocol smart contracts☆23Sep 14, 2023Updated 2 years ago
- mem1是mem0的魔改版本。我的魔改能让它生成效果更可用和更适合做情感陪伴项目☆36Dec 4, 2024Updated last year
- A plugin to use a language model to fill in parts of notes.☆16Feb 20, 2024Updated 2 years ago
- The Light Dark Matter eXperiment simulation and reconstruction framework.☆27Updated this week
- Official workloads used by OpenSearch Benchmark (OSB)☆30Mar 3, 2026Updated 2 weeks ago
- ☆26Sep 3, 2025Updated 6 months ago
- JsonML-related tools for losslessly converting between XML/HTML and JSON, including mixed-mode XML. http://jsonml.org☆12Nov 6, 2019Updated 6 years ago
- Miscellaneous codes and writings for MLOps☆15Updated this week
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated last year
- Create and analyze argument graphs and serialize them via Protobuf☆10Mar 13, 2026Updated last week
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32May 29, 2024Updated last year
- ☆12Sep 24, 2025Updated 5 months ago
- ☆10Nov 17, 2024Updated last year
- A Model Context Protocol (MCP) server that provides JSON-RPC functionality through OpenRPC.☆43Mar 2, 2026Updated 2 weeks ago
- ☆16Jan 16, 2025Updated last year