yale-nlp / DocMath-EvalLinks
Data and Code for ACL 2024 paper "DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Documents"
☆23Updated 11 months ago
Alternatives and similar repositories for DocMath-Eval
Users that are interested in DocMath-Eval are comparing it to the libraries listed below
Sorting:
- Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"☆22Updated last year
- Repo for outstanding paper@ACL 2023 "Do PLMs Know and Understand Ontological Knowledge?"☆33Updated 2 years ago
- ☆57Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆82Updated 2 years ago
- [ACL 2023] Plug-and-Play Knowledge Injection for Pre-trained Language Models☆61Updated last year
- ☆33Updated last year
- [EMNLP 2023] C-STS: Conditional Semantic Textual Similarity☆73Updated last year
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆50Updated 2 years ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆62Updated 2 years ago
- ☆32Updated last year
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆54Updated last year
- [ACL 23] CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors☆38Updated 9 months ago
- self-adaptive in-context learning☆45Updated 2 years ago
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆51Updated 2 years ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Updated 2 years ago
- ☆26Updated last year
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆50Updated last year
- The code implementation of the EMNLP2022 paper: DisCup: Discriminator Cooperative Unlikelihood Prompt-tuning for Controllable Text Gene…☆27Updated 2 years ago
- [Findings of ACL'2023] Improving Contrastive Learning of Sentence Embeddings from AI Feedback☆40Updated 2 years ago
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Updated 2 years ago
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆83Updated 2 years ago
- First explanation metric (diagnostic report) for text generation evaluation☆62Updated 8 months ago
- This repository provides the data and the codes used in the AAAI'24 paper, COOPER: Coordinating Specialized Agents towards a Complex Dial…☆25Updated last year
- ☆88Updated 2 years ago
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆18Updated last year
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Updated last year
- Official repo for ACL 2023 paper Code4Struct: Code Generation for Few-Shot Structured Prediction from Natural Language.☆43Updated last year
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆41Updated 2 years ago
- Understanding Factual Errors in Summarization: Errors, Summarizers, Datasets, Error Detectors (ACL 2023)☆28Updated last year
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)☆30Updated last month