This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the performance and challenges of Large Language Models (LLMs) in math tutoring scenarios, providing a benchmark dataset for evaluating LLM accuracy in educational contexts.
☆57Aug 29, 2024Updated last year
Alternatives and similar repositories for tutoring-accuracy-dataset
Users that are interested in tutoring-accuracy-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Exploring Knowledge Tracing in Tutor-Student Dialogues using LLMs" at LAK2025.☆35Feb 12, 2025Updated last year
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆84Sep 17, 2025Updated 8 months ago
- Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral☆39Dec 11, 2025Updated 6 months ago
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Jul 21, 2024Updated last year
- Writing Observer and Learning Observer: A system for monitoring learning process data, with an initial focus on writing process data from…☆12Jun 9, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Can VLMs understand students' hand-drawn math work?☆19Jan 20, 2026Updated 4 months ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆115Apr 19, 2025Updated last year
- EduAgent: Generative Student Agents in Learning☆35Feb 14, 2026Updated 4 months ago
- ☆11May 30, 2024Updated 2 years ago
- Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral☆36Nov 18, 2025Updated 6 months ago
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice De…☆19Mar 22, 2024Updated 2 years ago
- Code and data corresponding to "Hypothesis Only Baselines in Natural Language Inference" (StarSem 2018)☆25Dec 8, 2022Updated 3 years ago
- The first high-quality, fine-grained error-correction conversation dataset between English second language learner and an educational c…☆15Aug 27, 2025Updated 9 months ago
- ☆49Aug 6, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆13Oct 20, 2024Updated last year
- pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models☆405Jun 4, 2026Updated last week
- Follow Me: Conversation Planning for Target-driven Recommendation Dialogue Systems☆12Aug 1, 2023Updated 2 years ago
- ☆13May 9, 2023Updated 3 years ago
- ACL style for Typst☆23Jan 27, 2026Updated 4 months ago
- An Annotated Question Answering Dataset for Assisting Chinese Python Programming Learners☆10Feb 23, 2024Updated 2 years ago
- This is part of the code used in my Computational Social Science doctoral seminar at Rugers Unviersity in 2023☆12Jul 7, 2023Updated 2 years ago
- The official implementation for the paper Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense Knowledge.☆15Aug 14, 2023Updated 2 years ago
- ☆40Feb 4, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆26Nov 23, 2023Updated 2 years ago
- In-depth exploration of stable diffusion models, walking readers through the inner workings in a step-by-step manner.☆21Sep 8, 2025Updated 9 months ago
- Forecasting and Regression Analysis using textual data.☆15Sep 18, 2019Updated 6 years ago
- ☆30Apr 1, 2025Updated last year
- A small ETL for importing data from the common standards project into a relational database☆12Jul 17, 2018Updated 7 years ago
- An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors☆28Mar 2, 2026Updated 3 months ago
- ☆15Sep 20, 2025Updated 8 months ago
- Code for "Question Generation for Adaptive Education", to appear at ACL 2021.☆33Jul 18, 2021Updated 4 years ago
- ☆17Oct 31, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)☆12May 17, 2026Updated 3 weeks ago
- Official repository for the ICWSM '21 paper "More than meets the tie: Examining the Role of Interpersonal Relationships in Social Network…☆12Apr 26, 2023Updated 3 years ago
- SituLearner是综合影视字幕或音乐歌词的android单词学习软件, 它提供了英语单词或日语单词的情景化记忆手段。☆22May 18, 2026Updated 3 weeks ago
- Data from the online game Axon and code for for analysing it☆33Jan 1, 2014Updated 12 years ago
- ☆10Jun 1, 2024Updated 2 years ago
- A wiki platform for the students and teachers of Tsinghua University☆15May 10, 2026Updated last month
- Social Network Analysis and STEM Education is designed to prepare researchers to apply network analysis in order to better understand and…☆14Jul 14, 2025Updated 11 months ago