patronus-ai / financebench
☆136Updated 2 months ago
Alternatives and similar repositories for financebench:
Users that are interested in financebench are comparing it to the libraries listed below
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆101Updated 10 months ago
- Comprehensive benchmark for RAG☆114Updated 3 months ago
- Knowledge Graph Retrieval Augmented Generation (KG-RAG) Eval Datasets☆145Updated 10 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆146Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆145Updated 10 months ago
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆231Updated 4 months ago
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆129Updated 4 months ago
- Repository for "MultiHop-RAG: A Dataset for Evaluating Retrieval-Augmented Generation Across Documents" (COLM 2024)☆261Updated 3 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆147Updated 4 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆147Updated 2 months ago
- This repository implements the chain of verification paper by Meta AI☆163Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆198Updated 3 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆404Updated this week
- Attribute (or cite) statements generated by LLMs back to in-context information.☆197Updated 4 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆102Updated 3 months ago
- This is the repo for the LegalBench-RAG Paper: https://arxiv.org/abs/2408.10343.☆64Updated last month
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆106Updated last week
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 5 months ago
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆244Updated this week
- Sample notebooks and prompts for LLM evaluation☆120Updated 2 months ago
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆492Updated 7 months ago
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆201Updated 2 months ago
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆123Updated 9 months ago
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆90Updated 6 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆324Updated last month
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆109Updated 6 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆220Updated 5 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆406Updated last year
- Data and code for EMNLP 2022 paper "ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering"☆89Updated 2 years ago
- 🦜💯 Flex those feathers!☆239Updated 3 months ago