patronus-ai / financebench
☆126Updated last month
Alternatives and similar repositories for financebench:
Users that are interested in financebench are comparing it to the libraries listed below
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆101Updated 9 months ago
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets☆141Updated 9 months ago
- This repository implements the chain of verification paper by Meta AI☆160Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆141Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆106Updated 3 weeks ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆132Updated last month
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆242Updated last month
- 🦜💯 Flex those feathers!☆236Updated 2 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆145Updated 9 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆139Updated 3 months ago
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆187Updated 9 months ago
- Preprocessing pipeline notebooks and API supporting text extraction from SEC documents☆142Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆388Updated 2 weeks ago
- ☆137Updated 5 months ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆200Updated last month
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆204Updated 3 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆119Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆97Updated 4 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆190Updated 2 months ago
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆117Updated 3 months ago
- Sample notebooks and prompts for LLM evaluation☆119Updated last month
- awesome synthetic (text) datasets☆253Updated 2 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆307Updated 3 weeks ago
- Automated Evaluation of RAG Systems☆526Updated 2 months ago
- ☆62Updated 5 months ago
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆222Updated last week
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆87Updated 5 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆102Updated 2 months ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆115Updated last week