sujitpal / llm-rag-eval
Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.
☆27Updated last year
Alternatives and similar repositories for llm-rag-eval:
Users that are interested in llm-rag-eval are comparing it to the libraries listed below
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- LLM reads a paper and produce a working prototype☆55Updated last month
- ☆50Updated 5 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆79Updated 7 months ago
- ☆48Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 7 months ago
- Code for ScribeAgent paper☆57Updated 2 months ago
- Automatic Prompt Optimization☆34Updated last year
- ☆41Updated 4 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆52Updated 3 months ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆95Updated 6 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆110Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated last month
- Train your own SOTA deductive reasoning model☆92Updated 2 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- Measuring RAG solutions throughput and latency☆17Updated 9 months ago
- ☆77Updated 11 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆53Updated last month
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.☆81Updated 3 months ago
- Deep Research through Multi-Agents, using GraphRAG☆69Updated 6 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆67Updated 5 months ago
- Simple examples using Argilla tools to build AI☆52Updated 5 months ago
- ☆29Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆31Updated 2 months ago
- Track the progress of LLM context utilisation☆54Updated 3 weeks ago
- ☆18Updated 7 months ago
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments☆55Updated 2 months ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆30Updated 9 months ago