Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral
☆34Nov 18, 2025Updated 4 months ago
Alternatives and similar repositories for mathtutorbench
Users that are interested in mathtutorbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆78Sep 17, 2025Updated 6 months ago
- An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors☆27Mar 2, 2026Updated last month
- ☆36Feb 4, 2026Updated 2 months ago
- ☆24Jul 6, 2021Updated 4 years ago
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Jul 21, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- FlexEval is an LLM evaluation tool designed for practical quantitative analysis.☆16Mar 26, 2026Updated 2 weeks ago
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…☆15Apr 12, 2025Updated 11 months ago
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆22Jun 3, 2024Updated last year
- This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the p…☆57Aug 29, 2024Updated last year
- JIRA River Plugin for Elasticsearch☆24Apr 17, 2018Updated 7 years ago
- Generating Teacher Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots, ACL 2024 Findings☆13Mar 27, 2025Updated last year
- An implementation of the Latent Skill Embedding model☆10Feb 19, 2016Updated 10 years ago
- Code for the paper "Exploring Knowledge Tracing in Tutor-Student Dialogues using LLMs" at LAK2025.☆31Feb 12, 2025Updated last year
- ☆20Apr 16, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A simple Dataset generator for Moving Mnist☆14May 26, 2023Updated 2 years ago
- ☆10Jan 30, 2017Updated 9 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆32Dec 5, 2024Updated last year
- ☆23Mar 31, 2023Updated 3 years ago
- A collection of R packages for educational datamining☆15Jan 14, 2019Updated 7 years ago
- Because you're computing conversion rates wrong☆16May 23, 2017Updated 8 years ago
- ☆26Apr 18, 2020Updated 5 years ago
- [ICLR 2025] Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist☆35Oct 23, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Ross extension to Chernoff faces☆11May 21, 2018Updated 7 years ago
- Data, code, and images for a posting summarizing three studies about pie charts☆15Jul 12, 2016Updated 9 years ago
- Data on international first names and sex of people with that name☆13Jan 12, 2019Updated 7 years ago
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.☆17Aug 27, 2025Updated 7 months ago
- 사전에서 대화 예문만 추출한 데이터☆16Apr 24, 2023Updated 2 years ago
- CEduMEval : A Chinese educational multi-task evaluation benchmark☆17Nov 18, 2024Updated last year
- A web front end for R Twitter sentiment analysis☆18Sep 2, 2011Updated 14 years ago
- numpy实现常用的的机器学习库,分类模型实现:KNN,LDA,LR,Decision Tree(ID3,C4.5,CART),RF,perception,SVM,Neural network,GBDT,Xgboost,Adaboost;回归模型实现 :LASSO,Ridg…☆23Feb 19, 2022Updated 4 years ago
- Fitting stochastic blockmodels to graphs☆17Jul 8, 2016Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Command-line utility for fitting Hidden Markov Models at scale☆58Jul 21, 2022Updated 3 years ago
- A method for estimating causal effects in time-series data. Uses available data to automatically find natural experiments for identifying…☆17Dec 16, 2019Updated 6 years ago
- Unix stream tool using for Javascript and JSON☆16Feb 26, 2011Updated 15 years ago
- Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…☆19May 28, 2025Updated 10 months ago
- Compare geographic features☆14May 23, 2023Updated 2 years ago
- SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper im…☆19Dec 5, 2023Updated 2 years ago
- 1-day R workshop for experienced users. First session covers script writing, file names and overall project structures. The second sessio…☆13Jan 25, 2019Updated 7 years ago