Data and Code for ACL 2024 paper "DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Documents"
☆23Dec 21, 2024Updated last year
Alternatives and similar repositories for DocMath-Eval
Users that are interested in DocMath-Eval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"☆23Mar 29, 2024Updated 2 years ago
- Dataset and Codes for our EMNLP 2022 Main Conference Long Paper titled "ECTSum: A New Benchmark Dataset For Bullet Point Summarization of…☆34May 22, 2024Updated 2 years ago
- ☆16Sep 10, 2024Updated last year
- [NAACL'25] RuleR: Improving LLM Controllability by Rule-based Data Recycling☆14Sep 27, 2025Updated 8 months ago
- Code for "Unsupervised Enrichment of Persona-grounded Dialog with Background Stories", ACL 2021☆10Jul 8, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data and Code for the paper "FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains"☆24Aug 10, 2024Updated last year
- This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision…☆32Mar 12, 2024Updated 2 years ago
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago
- Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"☆15Feb 8, 2024Updated 2 years ago
- ☆11Jul 23, 2023Updated 2 years ago
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆54Oct 22, 2024Updated last year
- 🥇 A curated list of awesome large language models in finance(FinLLMs), including papers,models,datasets and codebases. 金融大模型列表,特别是中英双语大模…☆61Apr 7, 2026Updated 2 months ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16May 11, 2022Updated 4 years ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for EACL 2023 paper "LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control"☆20Feb 7, 2023Updated 3 years ago
- ☆26Jun 2, 2026Updated last week
- A text-to-network representation and semantic parsing toolkit.☆11Nov 11, 2019Updated 6 years ago
- A version of ONScripter-EN designed to work with the Steam API, OpenGL, and other upgrades☆12Mar 28, 2018Updated 8 years ago
- Unicode-only CJKV IDS data☆13Aug 9, 2024Updated last year
- Demo of using WASM to sandbox Plotly execution☆21Mar 30, 2025Updated last year
- Code for "End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs"☆14Oct 10, 2022Updated 3 years ago
- 汉字组件笔画数据☆15Aug 14, 2018Updated 7 years ago
- Chain of Images for Intuitively Reasoning☆10Nov 29, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated 2 years ago
- ☆15Apr 29, 2021Updated 5 years ago
- ☆23Feb 26, 2024Updated 2 years ago
- [MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.☆11Sep 24, 2024Updated last year
- This repository forked from parlAI. Korean Wizard of Wikipedia task was added to this repo. This repository is going to be moved after EM…☆16Dec 9, 2022Updated 3 years ago
- ☆27Jun 12, 2023Updated 3 years ago
- Data and Code for EMNLP 2022 paper "ReasTAP: Injecting Table Reasoning Skills During Pre-training via Synthetic Reasoning Examples"☆15Jun 4, 2023Updated 3 years ago
- Code for the paper "Critical Thinking for Language Models"☆13Jun 1, 2021Updated 5 years ago
- This is the implementation of paper "Learning to Ask Conversational Questions by Optimizing Levenshtein Distance".☆10Jul 5, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Utility functions/scripts for working with GPUs.☆10Jul 5, 2021Updated 4 years ago
- The source will be uploaded recently☆14Aug 3, 2020Updated 5 years ago
- ACL 2026 & NAACL 2025: Bridging Retrieval and Inference through Evidence Fusion☆13Apr 9, 2026Updated 2 months ago
- Encoder-decoders for translating different chemical formats.☆21Sep 17, 2025Updated 8 months ago
- A Domain-Specific Language, Jailbreak Attack Synthesizer and Dynamic LLM Redteaming Toolkit☆27Dec 5, 2024Updated last year
- Codebase for DualEnc (ACL-20)☆22Oct 3, 2023Updated 2 years ago
- Paper reading logs☆12Feb 26, 2022Updated 4 years ago