☆324Dec 3, 2024Updated last year
Alternatives and similar repositories for financebench
Users that are interested in financebench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- KITE (Knowledge-Intensive Task Evaluation) is an end-to-end benchmark for RAG pipelines☆23Aug 14, 2024Updated last year
- Code for reconstructing full-text news articles from the GDELT Web News NGrams 3.0 dataset☆32Apr 28, 2026Updated last month
- An OpenBB agent slack bot that is ready to answer any financial question☆12Feb 24, 2024Updated 2 years ago
- StAtutory Reasoning Assessment☆17Dec 8, 2022Updated 3 years ago
- Data and code for EMNLP 2022 paper "ConvFinQA: Exploring the Chain of Numerical Reasoning in Conversational Finance Question Answering"☆125Nov 9, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Mar 26, 2025Updated last year
- Python code examples for accessing and analyzing SEC's XBRL Data Sets☆100Jan 21, 2026Updated 4 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆251Dec 2, 2024Updated last year
- An implementation of Vector Clock in Java☆12Jun 27, 2025Updated 11 months ago
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- ☆45Jul 10, 2024Updated last year
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Sep 1, 2023Updated 2 years ago
- This is a work in progress package that enables users to conduct fundamental financial research, utilising the SEC's EDGAR API.☆73May 6, 2026Updated last month
- ☆21Oct 22, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Comprehensive benchmark for RAG☆287Jun 14, 2025Updated 11 months ago
- Data and Code for ACL 2024 paper "DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Docu…☆23Dec 21, 2024Updated last year
- The only open-source toolkit that can download SEC EDGAR financial reports and extract textual data from specific item sections into nice…☆521Jul 18, 2025Updated 10 months ago
- This repository contains related work, benchmarks and datasets for the paper "Large Language Models in Finance (FinLLMs)".☆371Apr 10, 2025Updated last year
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.☆2,211Oct 16, 2025Updated 7 months ago
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆28May 30, 2023Updated 3 years ago
- ☆14Oct 17, 2024Updated last year
- This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning …☆865Mar 4, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DEREK (Domain Entities and Relations Extraction Kit)☆10May 22, 2023Updated 3 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- Not everyone can code, but everyone can learn. This Project is an AI powered DSA/Competitive Programming Helper with an inbuilt editor to…☆14Jun 3, 2025Updated last year
- The FinEval financial domain evaluation benchmark, based on quantitative fundamental methods and developed through long-term objective re…☆272Jun 23, 2025Updated 11 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆275Mar 25, 2026Updated 2 months ago
- Data for paper "Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness"☆34May 3, 2023Updated 3 years ago
- WallStr.Chat is an AI research assistant for investment bankers, hedge funds, and PE firms, enabling parallel chat with dozens of PDFs, w…☆17Feb 8, 2026Updated 4 months ago
- code associated with WANLI dataset in Liu et al., 2022☆30May 24, 2023Updated 3 years ago
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆33Jul 7, 2024Updated last year
- This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling finan…☆75Jun 23, 2025Updated 11 months ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Jul 22, 2025Updated 10 months ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark☆22Aug 22, 2025Updated 9 months ago
- ☆59Jun 7, 2024Updated 2 years ago
- Vectors analytics and search library using dispersion models. Provides graph analysis, vector search and a energy-distribution stats for …☆35May 19, 2026Updated 3 weeks ago
- Read and analyze SEC EDGAR filings in Python. 10-K, 8-K, XBRL financials, Form 3/4/5, 13F, ADV — clean API, well-typed, MIT-licensed.☆2,286Jun 4, 2026Updated last week