This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the performance and challenges of Large Language Models (LLMs) in math tutoring scenarios, providing a benchmark dataset for evaluating LLM accuracy in educational contexts.
☆56Aug 29, 2024Updated last year
Alternatives and similar repositories for tutoring-accuracy-dataset
Users that are interested in tutoring-accuracy-dataset are comparing it to the libraries listed below
Sorting:
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆75Sep 17, 2025Updated 5 months ago
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Jul 21, 2024Updated last year
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆111Apr 19, 2025Updated 10 months ago
- Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…☆19May 28, 2025Updated 9 months ago
- Can VLMs understand students' hand-drawn math work?☆16Jan 20, 2026Updated last month
- ☆24Jul 6, 2021Updated 4 years ago
- ☆11May 30, 2024Updated last year
- EduAgent: Generative Student Agents in Learning☆31Feb 14, 2026Updated 2 weeks ago
- The first high-quality, fine-grained error-correction conversation dataset between English second language learner and an educational c…☆15Aug 27, 2025Updated 6 months ago
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice De…☆18Mar 22, 2024Updated last year
- An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors☆26Updated this week
- EDUVSUM is a multimodal neural architecture that utilizes state-of-the-art audio, visual and textual features to identify important tempo…