yale-nlp/DocMath-Eval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yale-nlp/DocMath-Eval)

yale-nlp / DocMath-Eval

Data and Code for ACL 2024 paper "DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Documents"

☆23

Alternatives and similar repositories for DocMath-Eval

Users that are interested in DocMath-Eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yale-nlp / QTSumm
View on GitHub
Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"
☆23Mar 29, 2024Updated 2 years ago
mjy1111 / PEAK
View on GitHub
The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models
☆16May 4, 2024Updated 2 years ago
rajdeep345 / ECTSum
View on GitHub
Dataset and Codes for our EMNLP 2022 Main Conference Long Paper titled "ECTSum: A New Benchmark Dataset For Bullet Point Summarization of…
☆34May 22, 2024Updated 2 years ago
gtfintechlab / FiNER
View on GitHub
☆16Sep 10, 2024Updated last year
yale-nlp / FinanceMath
View on GitHub
Data and Code for the paper "FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains"
☆25Jul 14, 2026Updated last week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
tianyi-lab / RuleR
View on GitHub
[NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling
☆14Sep 27, 2025Updated 9 months ago
bothe / dialogue-act-recognition
View on GitHub
Context-based Dialogue Act Recognition using Recurrent Neural Networks
☆13Nov 13, 2021Updated 4 years ago
majumderb / pabst
View on GitHub
Code for "Unsupervised Enrichment of Persona-grounded Dialog with Background Stories", ACL 2021
☆10Jul 8, 2021Updated 5 years ago
wyu-du / Controlled-Dialogue-Generation
View on GitHub
This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…
☆12Dec 1, 2021Updated 4 years ago
tangzhy / RealCritic
View on GitHub
☆15Jan 27, 2025Updated last year
woonsangcho / contrast_qgen
View on GitHub
Code for 'Contrastive Multi-Document Question Generation'
☆11Oct 16, 2022Updated 3 years ago
psunlpgroup / MultiHiertt
View on GitHub
Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"
☆54Oct 22, 2024Updated last year
seanie12 / SWEP
View on GitHub
[ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA
☆16May 11, 2022Updated 4 years ago
yasumasaonoe / ecbd
View on GitHub
☆11Apr 23, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
Coldmist-Lu / MQM_APE
View on GitHub
[MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.
☆12Sep 24, 2024Updated last year
cgraywang / TextHIN
View on GitHub
A text-to-network representation and semantic parsing toolkit.
☆11Nov 11, 2019Updated 6 years ago
fanhan-inside / fanhan-inside.github.io
View on GitHub
Personal Blog
☆11May 12, 2026Updated 2 months ago
ytyz1307zzh / PLUG
View on GitHub
Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"
☆13Aug 13, 2025Updated 11 months ago
jbshp / LongDocFACTScore
View on GitHub
☆10May 28, 2024Updated 2 years ago
TJYSunset / Strokes.txt
View on GitHub
汉字组件笔画数据
☆15Aug 14, 2018Updated 7 years ago
dair-iitd / FloNet
View on GitHub
Code for "End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs"
☆14Oct 10, 2022Updated 3 years ago
microsoft / clarification-qgen-globalinfo
View on GitHub
☆15Apr 29, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
neighthan / gpu-utils
View on GitHub
Utility functions/scripts for working with GPUs.
☆10Jul 5, 2021Updated 5 years ago
yale-nlp / InstruSum
View on GitHub
☆23Feb 26, 2024Updated 2 years ago
microsoft / TraceCodegen
View on GitHub
☆27Jun 12, 2023Updated 3 years ago
yale-nlp / Bright-Pro
View on GitHub
Data and code for ACL 2026 Paper "Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems…
☆19Apr 30, 2026Updated 2 months ago
debatelab / aacorpus
View on GitHub
Code for the paper "Critical Thinking for Language Models"
☆13Jun 1, 2021Updated 5 years ago
AIRC-KETI / kowow
View on GitHub
This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…
☆16Dec 9, 2022Updated 3 years ago
vyomakesh09 / longagent
View on GitHub
LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration
☆11Mar 11, 2024Updated 2 years ago
Fu-Fu-Fu-Fu / VideoKR
View on GitHub
[ICML 26 Spotlight] Code for paper "VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding"
☆19Jun 5, 2026Updated last month
LZKSKY / CaSE_RISE
View on GitHub
This is the implementation of paper "Learning to Ask Conversational Questions by Optimizing Levenshtein Distance".
☆10Jul 5, 2021Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
BUPT-GAMMA / THIGE
View on GitHub
The source will be uploaded recently
☆14Aug 3, 2020Updated 5 years ago
rmsander / spatial_LDA
View on GitHub
This repository contains the implementation of an image-based LDA model for use in semi-automation of the image annotation and data curat…
☆16Jun 16, 2026Updated last month
zhaochaocs / DualEnc
View on GitHub
Codebase for DualEnc (ACL-20)
☆22Oct 3, 2023Updated 2 years ago
toriving / Plz_Read_The_Paper
View on GitHub
Paper reading logs
☆12Feb 26, 2022Updated 4 years ago
klimzaporojets / consistent-EL
View on GitHub
Implementation of our paper "Towards Consistent Document-Level Entity Linking: Joint Models for Entity Linking and Coreference Resolution…
☆11Nov 13, 2022Updated 3 years ago
soummyaah / FinRED
View on GitHub
Dataset published in paper "FinRED: A Dataset for Relation Extraction in Financial Domain"
☆29Apr 15, 2022Updated 4 years ago
The-FinAI / The-FinData
View on GitHub
the benchmark for finance
☆11Jul 4, 2023Updated 3 years ago