eth-nlped / mathdialLinks
๐งฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023
โ68Updated 2 months ago
Alternatives and similar repositories for mathdial
Users that are interested in mathdial are comparing it to the libraries listed below
Sorting:
- NAACL 2021: Are NLP Models really able to Solve Simple Math Word Problems?โ135Updated 3 years ago
- Awesome LLM for NLG Evaluation Papersโ25Updated last year
- โ189Updated 4 months ago
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)โ56Updated 2 months ago
- Token-level Reference-free Hallucination Detectionโ96Updated 2 years ago
- Inspecting and Editing Knowledge Representations in Language Modelsโ119Updated 2 years ago
- paper list on reasoning in NLPโ194Updated 7 months ago
- RARR: Researching and Revising What Language Models Say, Using Language Modelsโ49Updated 2 years ago
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)โ64Updated last year
- Multilingual Large Language Models Evaluation Benchmarkโ133Updated last year
- โ50Updated 2 years ago
- โ116Updated last year
- โ22Updated 2 years ago
- Long Document Summarization Papersโ153Updated 2 years ago
- Codes for papers on Large Language Models Personalization (LaMP)โ176Updated 9 months ago
- โ85Updated 11 months ago
- โ50Updated 2 years ago
- ๐ฒ Code for our EMNLP 2023 paper - ๐ "Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Modeโฆโ52Updated last year
- โ88Updated 2 years ago
- Code and data for Marked Personas (ACL 2023)โ28Updated 2 years ago
- Code, datasets, and checkpoints for the paper "Improving Passage Retrieval with Zero-Shot Question Generation (EMNLP 2022)"โ100Updated 2 years ago
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomicโฆโ403Updated 7 months ago
- First explanation metric (diagnostic report) for text generation evaluationโ62Updated 8 months ago
- The official repo for SocKET: Social Knowledge Evaluation Testsโ24Updated 6 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersโ135Updated last year
- [EMNLP 2022] TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Modelsโ74Updated last year
- This is a repository for sharing papers in the field of persona-based conversational AI. The related source code for each paper is linkedโฆโ168Updated last year
- โ78Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".โ164Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.โ165Updated 2 years ago