sujitpal / llm-rag-evalLinks
Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.
☆27Updated last year
Alternatives and similar repositories for llm-rag-eval
Users that are interested in llm-rag-eval are comparing it to the libraries listed below
Sorting:
- LLM reads a paper and produce a working prototype☆57Updated last month
- ☆50Updated this week
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆97Updated 7 months ago
- Simple GRPO scripts and configurations.☆58Updated 4 months ago
- Verifiers for LLM Reinforcement Learning☆56Updated last month
- Simple examples using Argilla tools to build AI☆53Updated 6 months ago
- ☆49Updated 6 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆113Updated 3 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆24Updated 2 months ago
- ☆26Updated 2 months ago
- ☆67Updated 3 months ago
- Train your own SOTA deductive reasoning model☆92Updated 2 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆78Updated 2 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆90Updated 2 months ago
- AIDE: the Machine Learning CodeGen Agent☆24Updated 7 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆40Updated last year
- ☆45Updated last year
- ☆24Updated 5 months ago
- ☆46Updated 8 months ago
- Deep Research through Multi-Agents, using GraphRAG☆71Updated 6 months ago
- ☆56Updated 6 months ago
- ☆29Updated 3 weeks ago
- Automatic Prompt Optimization☆36Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆81Updated 8 months ago
- ☆77Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 8 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆60Updated last week