Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral
โ32Nov 18, 2025Updated 3 months ago
Alternatives and similar repositories for mathtutorbench
Users that are interested in mathtutorbench are comparing it to the libraries listed below
Sorting:
- Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oralโ31Dec 11, 2025Updated 2 months ago
- ๐งฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023โ75Sep 17, 2025Updated 5 months ago
- An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutorsโ26Dec 20, 2025Updated 2 months ago
- FlexEval is an LLM evaluation tool designed for practical quantitative analysis.โ16Sep 19, 2025Updated 5 months ago
- โ36Feb 4, 2026Updated 3 weeks ago
- NAACL 2024. Code & Dataset for "๐ Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakeโฆโ45Jul 21, 2024Updated last year
- This repository hosts the paper โLLM Based Math Tutoring: Challenges and Datasetโ, along with the accompanying dataset. It explores the pโฆโ56Aug 29, 2024Updated last year
- โ24Jul 6, 2021Updated 4 years ago
- development moved to https://github.com/myudelson/hmm-scalableโ38May 7, 2019Updated 6 years ago
- โ10Jan 30, 2017Updated 9 years ago
- Exploring classifier-free guidance in a DDPM language model for text generation towards emotion targets.โ11Sep 7, 2025Updated 5 months ago
- Because you're computing conversion rates wrongโ16May 23, 2017Updated 8 years ago
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.โ17Aug 27, 2025Updated 6 months ago
- JIRA River Plugin for Elasticsearchโ24Apr 17, 2018Updated 7 years ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMsโ29Dec 24, 2025Updated 2 months ago
- A web front end for R Twitter sentiment analysisโ18Sep 2, 2011Updated 14 years ago
- Monotonic Attention based ConvBERT for Knowledge Tracingโ15Sep 14, 2022Updated 3 years ago
- Generating Teacher Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots, ACL 2024 Findingsโ12Mar 27, 2025Updated 11 months ago
- An implementation of the Latent Skill Embedding modelโ10Feb 19, 2016Updated 10 years ago
- network visualization using plotlyโ12Sep 25, 2016Updated 9 years ago
- Unix stream tool using for Javascript and JSONโ16Feb 26, 2011Updated 15 years ago
- Repository to create easy and straightforward slides in Manim together with Manim-slidesโ20Nov 26, 2025Updated 3 months ago
- Fitting stochastic blockmodels to graphsโ17Jul 8, 2016Updated 9 years ago
- https://openreview.net/forum?id=OC1o4_OI6Jwโ13May 27, 2022Updated 3 years ago
- Memory-optimized training scripts for video models based on Diffusersโ14Jan 3, 2025Updated last year
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Languโฆโ13Apr 12, 2025Updated 10 months ago
- Toolbox for people detecting, tracking, and re-identifying.โ18Feb 7, 2026Updated 3 weeks ago
- Command-line utility for fitting Hidden Markov Models at scaleโ57Jul 21, 2022Updated 3 years ago
- A Javascript library to conveniently add distribution builders to your online and offline experiments.โ15May 25, 2023Updated 2 years ago
- Command-line utility for exporting OmniGraffle documentsโ25Oct 18, 2016Updated 9 years ago
- SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper imโฆโ18Dec 5, 2023Updated 2 years ago
- โ15Jan 2, 2022Updated 4 years ago
- A method for estimating causal effects in time-series data. Uses available data to automatically find natural experiments for identifyingโฆโ17Dec 16, 2019Updated 6 years ago
- 2021 CIKM: source code for the Multi-Factors Aware Dual-Attentional Knowledge Tracing (MF-DAKT)โ16Apr 25, 2023Updated 2 years ago
- ์ฌ์ ์์ ๋ํ ์๋ฌธ๋ง ์ถ์ถํ ๋ฐ์ดํฐโ16Apr 24, 2023Updated 2 years ago
- Multiple membership random effects. Wrapper around lme4::lmer and lme4::glmer.โ22Oct 18, 2023Updated 2 years ago
- Beyond LM: How can language model go forward in the future?โ15Apr 30, 2023Updated 2 years ago
- โ15Dec 10, 2015Updated 10 years ago
- data center cooling with reinforcement learningโ15Jun 3, 2020Updated 5 years ago