Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral
☆32Nov 18, 2025Updated 4 months ago
Alternatives and similar repositories for mathtutorbench
Users that are interested in mathtutorbench are comparing it to the libraries listed below
Sorting:
- Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral☆33Dec 11, 2025Updated 3 months ago
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆77Sep 17, 2025Updated 6 months ago
- An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors☆26Mar 2, 2026Updated 2 weeks ago
- ☆36Feb 4, 2026Updated last month
- ☆24Jul 6, 2021Updated 4 years ago
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Jul 21, 2024Updated last year
- FlexEval is an LLM evaluation tool designed for practical quantitative analysis.☆16Sep 19, 2025Updated 6 months ago
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…☆13Apr 12, 2025Updated 11 months ago
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆22Jun 3, 2024Updated last year
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆30Dec 24, 2025Updated 2 months ago
- JIRA River Plugin for Elasticsearch☆24Apr 17, 2018Updated 7 years ago
- Generating Teacher Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots, ACL 2024 Findings☆13Mar 27, 2025Updated 11 months ago
- TreeInstruct is a novel method that uses state space estimation and dynamic tree-based questioning for multi-turn Socratic instruction, a…☆18Aug 1, 2025Updated 7 months ago
- An implementation of the Latent Skill Embedding model☆10Feb 19, 2016Updated 10 years ago
- Askalot CQA System of Next Generation☆26Dec 14, 2022Updated 3 years ago
- Code for the paper "Exploring Knowledge Tracing in Tutor-Student Dialogues using LLMs" at LAK2025.☆28Feb 12, 2025Updated last year
- development moved to https://github.com/myudelson/hmm-scalable☆38May 7, 2019Updated 6 years ago
- ☆20Apr 16, 2025Updated 11 months ago
- A simple Dataset generator for Moving Mnist☆14May 26, 2023Updated 2 years ago
- ☆10Jan 30, 2017Updated 9 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆32Dec 5, 2024Updated last year
- ☆15Jan 2, 2022Updated 4 years ago
- ☆23Aug 1, 2022Updated 3 years ago
- A collection of R packages for educational datamining☆15Jan 14, 2019Updated 7 years ago
- Because you're computing conversion rates wrong☆16May 23, 2017Updated 8 years ago
- ☆26Apr 18, 2020Updated 5 years ago
- [ICLR 2025] Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist☆35Oct 23, 2024Updated last year
- Ross extension to Chernoff faces☆11May 21, 2018Updated 7 years ago
- Code for spike coding networks☆16Jan 8, 2021Updated 5 years ago
- Data, code, and images for a posting summarizing three studies about pie charts☆15Jul 12, 2016Updated 9 years ago
- 사전에서 대화 예문만 추출한 데이터☆16Apr 24, 2023Updated 2 years ago
- CEduMEval : A Chinese educational multi-task evaluation benchmark☆17Nov 18, 2024Updated last year
- A web front end for R Twitter sentiment analysis☆18Sep 2, 2011Updated 14 years ago
- numpy实现常用的的机器学习库,分类模型实现:KNN,LDA,LR,Decision Tree(ID3,C4.5,CART),RF,perception,SVM,Neural network,GBDT,Xgboost,Adaboost;回归模型实现 :LASSO,Ridg…☆23Feb 19, 2022Updated 4 years ago
- Automatic Generation of Scaffolding Questions for Learning Math, EMNLP 2022. RL, REINFORCE☆25Jun 30, 2023Updated 2 years ago
- Remove unwanted LaTeX commands and their associated closing brackets☆11Jul 22, 2024Updated last year
- Race and Ethnicity based on name using data from census, voter reg. files, etc.☆11Jan 17, 2018Updated 8 years ago
- Command-line utility for fitting Hidden Markov Models at scale☆57Jul 21, 2022Updated 3 years ago