Measuring RAG solutions throughput and latency
☆20Jul 23, 2024Updated last year
Alternatives and similar repositories for RAG-Performance
Users that are interested in RAG-Performance are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Feb 25, 2025Updated last year
- 一款基于FastGPT的微信机器人插件,提供智能知识库问答功能!💬☆10May 10, 2025Updated 11 months ago
- ☆22Oct 14, 2024Updated last year
- dow新协议接口☆21Jun 2, 2025Updated 10 months ago
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- markdown-image-manage for vscode☆13Jun 7, 2025Updated 10 months ago
- LMQL implementation of tree of thoughts☆36Jan 31, 2024Updated 2 years ago
- Anki Flashcards from PDFs with AI☆12May 21, 2024Updated last year
- ☆21Oct 6, 2023Updated 2 years ago
- ☆15May 30, 2023Updated 2 years ago
- Code for the paper "Towards an Argument Mining Pipeline Transforming Texts to Argument Graphs" presented at COMMA 2020☆23Mar 25, 2025Updated last year
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆57Feb 4, 2026Updated 2 months ago
- computer study☆28Jan 25, 2026Updated 2 months ago
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- GraphRAG for Second Brain. Ingest knowledge -> build knowledge graphs -> Query relevant knowledge | Explore connections☆21Jun 5, 2025Updated 10 months ago
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆51Mar 25, 2026Updated 2 weeks ago
- ☆28Apr 10, 2025Updated last year
- Autonomous AI backend for deep research AI applications.☆58Mar 30, 2026Updated last week
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- An implementation of MSSRM method☆11Mar 23, 2023Updated 3 years ago
- A highly contextualized retrieval system integrating Large Language Models (LLMs), embeddings, and a dynamic agent-driven framework. Supp…☆27Sep 24, 2025Updated 6 months ago
- A repository of Juris-M style modules☆16Jan 17, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆42Apr 29, 2023Updated 2 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- Algofi protocol smart contracts☆23Sep 14, 2023Updated 2 years ago
- ☆10Aug 6, 2025Updated 8 months ago
- ☆10Jun 10, 2022Updated 3 years ago
- ☆17Aug 5, 2025Updated 8 months ago
- mem1是mem0的魔改版本。我的魔改能让它生成效果更可用和更适合做情感陪伴项目☆37Dec 4, 2024Updated last year
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆17Jul 21, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Examples in the MLX framework☆11Sep 23, 2024Updated last year
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆30Jul 23, 2024Updated last year
- Official workloads used by OpenSearch Benchmark (OSB)☆30Updated this week
- Code for Rethinking Prompt Optimizers: From Prompt Merits to Optimization☆13Jan 12, 2026Updated 2 months ago
- JsonML-related tools for losslessly converting between XML/HTML and JSON, including mixed-mode XML. http://jsonml.org☆12Nov 6, 2019Updated 6 years ago
- ComfyUI-Direct3D‑S2 is now available in ComfyUI, Direct3D‑S2 - Gigascale 3D Generation Made Easy with Spatial Sparse Attention. Direct3D‑…☆17Jun 10, 2025Updated 10 months ago
- Miscellaneous codes and writings for MLOps☆15Updated this week