☆301Dec 3, 2024Updated last year
Alternatives and similar repositories for financebench
Users that are interested in financebench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data and code for EMNLP 2021 paper "FinQA: A Dataset of Numerical Reasoning over Financial Data"☆367Jun 6, 2022Updated 3 years ago
- An OpenBB agent slack bot that is ready to answer any financial question☆12Feb 24, 2024Updated 2 years ago
- A package to parse SEC XBRL at scale.☆19Nov 25, 2025Updated 5 months ago
- ☆26Oct 23, 2025Updated 6 months ago
- StAtutory Reasoning Assessment☆17Dec 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Research Artifact For Our Submission To VLDB☆11Oct 27, 2021Updated 4 years ago
- Data and code for EMNLP 2022 paper "ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering"☆120Nov 9, 2022Updated 3 years ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆241Dec 2, 2024Updated last year
- ☆15Oct 30, 2021Updated 4 years ago
- Measuring RAG solutions throughput and latency☆20Jul 23, 2024Updated last year
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- ☆43Jul 10, 2024Updated last year
- ☆17May 14, 2025Updated 11 months ago
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Sep 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The PIZZA dataset continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, who…☆20Dec 7, 2022Updated 3 years ago
- Comprehensive benchmark for RAG☆283Jun 14, 2025Updated 10 months ago
- Data and Code for ACL 2024 paper "DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Docu…☆23Dec 21, 2024Updated last year
- LUNA: a Framework for Language Understanding and Naturalness Assessment.☆12Sep 9, 2023Updated 2 years ago
- The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice…☆511Jul 18, 2025Updated 9 months ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,162Oct 16, 2025Updated 6 months ago
- This repository contains related work, benchmarks and datasets for the paper "Large Language Models in Finance (FinLLMs)".☆365Apr 10, 2025Updated last year
- ☆14Oct 17, 2024Updated last year
- Outline to Story: Fine-grained Controllable Story Generation from Cascaded Events☆18Jun 16, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- BERT score for text generation☆12Jan 15, 2025Updated last year
- The FinEval financial domain evaluation benchmark, based on quantitative fundamental methods and developed through long-term objective re…☆270Jun 23, 2025Updated 10 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆270Mar 25, 2026Updated last month
- Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"☆34May 3, 2023Updated 2 years ago
- Apify's reusable github workflows☆15Updated this week
- WallStr.Chat is an AI research assistant for investment bankers, hedge funds, and PE firms, enabling parallel chat with dozens of PDFs, w…☆17Feb 8, 2026Updated 2 months ago
- code associated with WANLI dataset in Liu et al., 2022☆30May 24, 2023Updated 2 years ago
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆41Sep 22, 2024Updated last year
- ☆19Mar 25, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code Repository for "A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language Models".☆15Oct 14, 2022Updated 3 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Jul 22, 2025Updated 9 months ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 8 months ago
- Scaling Agentic Environments Automatically.☆62Mar 26, 2026Updated last month
- ☆59Jun 7, 2024Updated last year
- Python library to access and analyze SEC Edgar filings, XBRL financial statements, 10-K, 10-Q, and 8-K reports☆2,067Updated this week
- VectorDB library using dispersion models. Provides graph analysis, vector search and a energy-distribution stats for your vectors in one …☆35Updated this week