sujitpal / llm-rag-eval
Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.
☆14Updated 4 months ago
Related projects: ⓘ
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 2 months ago
- LLM reads a paper and produce a working prototype☆19Updated this week
- ☆30Updated last week
- Automating enterprise workflows with multimodal agents☆83Updated last month
- Tutorial for DSPy☆18Updated 4 months ago
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆15Updated 4 months ago
- ☆61Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- ☆71Updated 3 months ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆40Updated 9 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆73Updated 2 months ago
- Track the progress of LLM context utilisation☆53Updated 2 months ago
- ☆37Updated last month
- Writing Blog Posts with Generative Feedback Loops!☆41Updated 6 months ago
- ☆18Updated 2 months ago
- Demonstration of how to run multiple chains in Langchain Assyncronously☆12Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆26Updated 11 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆36Updated 6 months ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆93Updated 5 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆70Updated last month
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆12Updated 6 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆96Updated 10 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆49Updated 3 weeks ago
- Tools to make language models a bit easier to use☆22Updated last week
- StructuredRAG Benchmarker☆85Updated last week
- RAG example using DSPy, Gradio, FastAPI☆57Updated 5 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆45Updated 8 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 6 months ago
- ☆27Updated 6 months ago