This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the performance and challenges of Large Language Models (LLMs) in math tutoring scenarios, providing a benchmark dataset for evaluating LLM accuracy in educational contexts.
☆56Aug 29, 2024Updated last year
Alternatives and similar repositories for tutoring-accuracy-dataset
Users that are interested in tutoring-accuracy-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆77Sep 17, 2025Updated 6 months ago
- ☆24Jul 6, 2021Updated 4 years ago
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Jul 21, 2024Updated last year
- Can VLMs understand students' hand-drawn math work?☆17Jan 20, 2026Updated 2 months ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆112Apr 19, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral☆33Nov 18, 2025Updated 4 months ago
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice De…☆18Mar 22, 2024Updated 2 years ago
- ☆12Jan 25, 2024Updated 2 years ago
- pyKT: A Python Library to Benchmark Deep Learning based Knowledge Tracing Models☆380Jan 13, 2026Updated 2 months ago
- This is the website for the Language Technology and Data Analysis Laboratory (LADAL) which is part of the School of Languages and Culture…☆14Jan 29, 2025Updated last year
- R package that provides utilities for processing and analyzing the files that are exported from a recorded 'Zoom' Meeting. This includes …☆16Apr 5, 2022Updated 3 years ago
- Follow Me: Conversation Planning for Target-driven Recommendation Dialogue Systems☆11Aug 1, 2023Updated 2 years ago
- ☆13May 9, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆26Nov 23, 2023Updated 2 years ago
- ☆27Apr 8, 2025Updated 11 months ago
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆25May 29, 2024Updated last year
- ☆30Apr 1, 2025Updated 11 months ago
- An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors☆26Mar 2, 2026Updated 3 weeks ago
- ☆14Sep 20, 2025Updated 6 months ago
- ☆17Oct 31, 2023Updated 2 years ago
- ☆12Jan 16, 2022Updated 4 years ago
- 供大学生,竞赛生,高中生查找的math-wiki☆10May 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SituLearner是综合影视字幕或音乐歌词的android单词学习软件, 它提供了英语单词或日语单词的情景化记忆手段。☆17Dec 10, 2025Updated 3 months ago
- [WWW '25 Oral - GenMentor] Official code of our paper "LLM-powered Multi-agent Framework for Goal-oriented Learning in Intelligent Tutori…☆58Dec 3, 2025Updated 3 months ago
- Data from the online game Axon and code for for analysing it☆33Jan 1, 2014Updated 12 years ago
- A wiki platform for the students and teachers of Tsinghua University☆16Mar 17, 2026Updated last week
- Social Network Analysis and STEM Education is designed to prepare researchers to apply network analysis in order to better understand and…☆14Jul 14, 2025Updated 8 months ago
- 补充了一些Visualglm缺少的文件,可以对Visualglm进行训练,实例中是对人脸做了面相的识别☆13Jun 7, 2023Updated 2 years ago
- ☆19Jun 7, 2021Updated 4 years ago
- View presentation slides in Scrapbox☆16Jun 5, 2025Updated 9 months ago
- Using NLP to automatically score autobiographical interview narratives☆20Dec 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- a PhoneGap/Cordova plugin for OpenEars:☆10Apr 27, 2015Updated 10 years ago
- Egret Study Round☆11Nov 20, 2015Updated 10 years ago
- Source code of DisenHAN: Disentangled Heterogeneous Graph Attention Network for Recommendation, CIKM 2020☆14Mar 18, 2023Updated 3 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- Deploy Meilisearch to render.com☆11Feb 8, 2023Updated 3 years ago
- Text classifier, based on the BERT and a Bayesian neural network, which can train on small labeled texts and doubt its decision.☆14Mar 24, 2023Updated 3 years ago
- Content Based Image Recognition Platform for Early Printed Materials☆11Mar 28, 2019Updated 6 years ago