eth-nlped / mathdialLinks
๐งฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023
โ74Updated 4 months ago
Alternatives and similar repositories for mathdial
Users that are interested in mathdial are comparing it to the libraries listed below
Sorting:
- NAACL 2021: Are NLP Models really able to Solve Simple Math Word Problems?โ139Updated 3 years ago
- โ187Updated 7 months ago
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)โ59Updated 4 months ago
- Awesome LLM for NLG Evaluation Papersโ25Updated 2 years ago
- The implementation of "RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question" [ACL 2023]โ16Updated last year
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linkedโฆโ169Updated last year
- โ84Updated last week
- HANNA, a large annotated dataset of Human-ANnotated NArratives for ASG evaluation.โ35Updated last year
- โ38Updated 2 years ago
- ๐ฒ Code for our EMNLP 2023 paper - ๐ "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Modeโฆโ54Updated 2 years ago
- Token-level Reference-free Hallucination Detectionโ98Updated 2 years ago
- โ47Updated 4 months ago
- โ294Updated 2 years ago
- paper list on reasoning in NLPโ195Updated 10 months ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.โ150Updated last year
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"โ75Updated 3 years ago
- RARR: Researching and Revising What Language Models Say, Using Language Modelsโ51Updated 2 years ago
- โ88Updated 2 years ago
- A comprehensive paper list of Reasoning over Tables.โ30Updated 3 years ago
- First explanation metric (diagnostic report) for text generation evaluationโ62Updated 11 months ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"โ100Updated 3 years ago
- Multilingual Large Language Models Evaluation Benchmarkโ133Updated last year
- Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"โ62Updated 3 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"โ81Updated last year
- โ83Updated 2 years ago
- Codebase, data and models for the SummaC paper in TACLโ108Updated last year
- Source Code of Paper "GPTScore: Evaluate as You Desire"โ258Updated 2 years ago
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022โ189Updated last year
- โ117Updated last year
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.โ154Updated 5 months ago