eth-nlped / mathdialLinks
๐งฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023
โ72Updated 3 months ago
Alternatives and similar repositories for mathdial
Users that are interested in mathdial are comparing it to the libraries listed below
Sorting:
- โ188Updated 6 months ago
- NAACL 2021: Are NLP Models really able to Solve Simple Math Word Problems?โ136Updated 3 years ago
- paper list on reasoning in NLPโ193Updated 8 months ago
- โ83Updated 3 weeks ago
- Source Code of Paper "GPTScore: Evaluate as You Desire"โ257Updated 2 years ago
- Inspecting and Editing Knowledge Representations in Language Modelsโ119Updated 2 years ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.โ148Updated last year
- Token-level Reference-free Hallucination Detectionโ97Updated 2 years ago
- โ294Updated 2 years ago
- Awesome LLM for NLG Evaluation Papersโ25Updated last year
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generationโ214Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.โ153Updated 4 months ago
- ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Modelsโ47Updated last year
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linkedโฆโ167Updated last year
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"โ81Updated last year
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)โ58Updated 3 months ago
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.โ154Updated 3 months ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomicโฆโ414Updated 8 months ago
- RARR: Researching and Revising What Language Models Say, Using Language Modelsโ49Updated 2 years ago
- โ88Updated 2 years ago
- Codebase, data and models for the SummaC paper in TACLโ106Updated 11 months ago
- โ116Updated last year
- โ50Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.โ42Updated 2 years ago
- Repository for MuSiQue: Multi-hop Questions via Single-hop Question Composition, TACL 2022โ185Updated last year
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"โ75Updated 3 years ago
- โ38Updated 2 years ago
- Multilingual Large Language Models Evaluation Benchmarkโ133Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.โ165Updated 2 years ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maskeโฆโ127Updated last year