Measuring RAG solutions throughput and latency
☆20Jul 23, 2024Updated last year
Alternatives and similar repositories for RAG-Performance
Users that are interested in RAG-Performance are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Feb 25, 2025Updated last year
- ☆20Apr 14, 2026Updated 2 weeks ago
- dow新协议接口☆21Jun 2, 2025Updated 10 months ago
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 7 months ago
- ☆15Feb 27, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Large Multimodal Model☆15Apr 8, 2024Updated 2 years ago
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆31Feb 14, 2026Updated 2 months ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 5 months ago
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 3 months ago
- computer study☆28Apr 23, 2026Updated last week
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆59Feb 4, 2026Updated 2 months ago
- GraphRAG for Second Brain. Ingest knowledge -> build knowledge graphs -> Query relevant knowledge | Explore connections☆25Jun 5, 2025Updated 10 months ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- 🔥 热点聚合推送平台 - 聚合微博、知乎、B站等 13+ 平台热榜,支持 Telegram、Discord、企业微信等多渠道推送☆68Feb 28, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An implementation of MSSRM method☆10Mar 23, 2023Updated 3 years ago
- A highly contextualized retrieval system integrating Large Language Models (LLMs), embeddings, and a dynamic agent-driven framework. Supp…☆27Sep 24, 2025Updated 7 months ago
- Get 笔记 openclaw Skill☆95Updated this week
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆42Apr 29, 2023Updated 3 years ago
- 📚【更新中】AI-Driven Enterprise Security: Architecture, Methodology, and Practice:AI驱动的企业安全建设实战,覆盖安全架构设计、方法论框架与工程实践,系统化提出 AISecOps 方法论框架,将 AI…☆92Jan 31, 2026Updated 3 months ago
- Open-source and self-hostable wallet key management. Create non-custodial wallets for users in Ethereum and Solana☆80Apr 16, 2026Updated 2 weeks ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆11Jan 9, 2024Updated 2 years ago
- Exploring advanced prompting tools to query SQL database with multiple tables in natural language using LLMs☆16Aug 23, 2024Updated last year
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆22Dec 2, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A python client library for interacting with a Drools KIE SERVER☆32Jan 23, 2023Updated 3 years ago
- ☆10Jun 10, 2022Updated 3 years ago
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆18Jul 21, 2025Updated 9 months ago
- Examples in the MLX framework☆11Sep 23, 2024Updated last year
- LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)☆30Jul 23, 2024Updated last year
- Code for Rethinking Prompt Optimizers: From Prompt Merits to Optimization☆13Jan 12, 2026Updated 3 months ago
- ComfyUI-Direct3D‑S2 is now available in ComfyUI, Direct3D‑S2 - Gigascale 3D Generation Made Easy with Spatial Sparse Attention. Direct3D‑…☆17Jun 10, 2025Updated 10 months ago
- Miscellaneous codes and writings for MLOps☆15Apr 8, 2026Updated 3 weeks ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Create and analyze argument graphs and serialize them via Protobuf☆10Apr 19, 2026Updated last week
- ☆10Nov 17, 2024Updated last year
- ☆15Oct 9, 2024Updated last year
- Flexible and transparent Python Boruta implementation☆15Jun 8, 2025Updated 10 months ago
- The API extractor for npm packages powering jsDocs.io☆15Apr 2, 2026Updated 3 weeks ago
- Building a Legal Case Search Engine Using Qdrant, Llama 3, LangChain and Exploring Different Filtering Techniques☆16Jul 6, 2024Updated last year
- data about OAB Exams☆12Oct 1, 2018Updated 7 years ago