Data and Code for ACL 2024 paper "DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Documents"
☆23Dec 21, 2024Updated last year
Alternatives and similar repositories for DocMath-Eval
Users that are interested in DocMath-Eval are comparing it to the libraries listed below
Sorting:
- Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"☆22Mar 29, 2024Updated last year
- Data and Code for the paper "FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains"☆24Aug 10, 2024Updated last year
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆35Nov 18, 2025Updated 3 months ago
- ☆27Jun 12, 2023Updated 2 years ago
- Dataset and Codes for our EMNLP 2022 Main Conference Long Paper titled "ECTSum: A New Benchmark Dataset For Bullet Point Summarization of…☆32May 22, 2024Updated last year
- This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling finan…☆71Jun 23, 2025Updated 8 months ago
- This is the official implementation of the paper "MM-SHAP: A Performance-agnostic Metric for Measuring Multimodal Contributions in Vision…☆32Mar 12, 2024Updated last year
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- o1 Chain of Thought Examples☆33Oct 4, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- Concurrency library☆17Oct 13, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- ☆25Sep 1, 2025Updated 6 months ago
- CANdle - a library for using USB-FDCAN dongle and communicating with md80 drives☆15Sep 15, 2025Updated 5 months ago
- [MQM-APE] Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators.☆11Sep 24, 2024Updated last year
- Develop C++/CUDA extensions with PyTorch like Python scripts☆10Jan 7, 2026Updated last month
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- ☆10May 28, 2024Updated last year
- ☆10Apr 7, 2024Updated last year
- An active inference model of Lacanian psychoanalysis☆15Jun 7, 2025Updated 8 months ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Python Inference Script(PyIS)☆19Aug 30, 2022Updated 3 years ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆11Nov 27, 2022Updated 3 years ago
- Models for packages and the resources they contain.☆14Mar 10, 2024Updated last year
- 西方学者普遍从汉字部件出发理解汉字,该库给出了中文部件分解的详细说明和数据库。☆11Jul 20, 2023Updated 2 years ago
- ☆50Jun 7, 2025Updated 8 months ago
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆52Oct 22, 2024Updated last year
- Code for "Unsupervised Enrichment of Persona-grounded Dialog with Background Stories", ACL 2021☆10Jul 8, 2021Updated 4 years ago
- [NeurIPS 2024] CHASE: Learning Convex Hull Adaptive Shift for Skeleton-based Multi-Entity Action Recognition☆16Nov 12, 2025Updated 3 months ago
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated last year
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Memory experiments with LLMs☆11Mar 31, 2023Updated 2 years ago
- ☆11Apr 6, 2024Updated last year
- Personal Blog☆11Updated this week
- JSON RPC v2.0 Sans I/O☆11Updated this week
- f-PO: Generalizing Preference Optimization with f-divergence Minimization☆13Apr 2, 2025Updated 11 months ago
- A text-to-network representation and semantic parsing toolkit.☆11Nov 11, 2019Updated 6 years ago
- This library implements functions and classes for mesh registration, data augmentation, and data normalisation.☆11Oct 7, 2024Updated last year
- Bridging Retrieval and Inference through Evidence Fusion☆12Oct 20, 2025Updated 4 months ago