eth-lre/mathtutorbench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eth-lre/mathtutorbench)

eth-lre / mathtutorbench

Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral

☆32

Alternatives and similar repositories for mathtutorbench

Users that are interested in mathtutorbench are comparing it to the libraries listed below

Sorting:

eth-lre / PedagogicalRL
View on GitHub
Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral
☆31Dec 11, 2025Updated 2 months ago
eth-nlped / mathdial
View on GitHub
🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023
☆75Sep 17, 2025Updated 5 months ago
kaushal0494 / UnifyingAITutorEvaluation
View on GitHub
An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors
☆26Dec 20, 2025Updated 2 months ago
DigitalHarborFoundation / FlexEval
View on GitHub
FlexEval is an LLM evaluation tool designed for practical quantitative analysis.
☆16Sep 19, 2025Updated 5 months ago
ddemszky / classroom-transcript-analysis
View on GitHub
☆36Feb 4, 2026Updated 3 weeks ago
rosewang2008 / bridge
View on GitHub
NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…
☆45Jul 21, 2024Updated last year
Khan / tutoring-accuracy-dataset
View on GitHub
This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the p…
☆56Aug 29, 2024Updated last year
kstats / CIMA
View on GitHub
☆24Jul 6, 2021Updated 4 years ago
IEDMS / standard-bkt
View on GitHub
development moved to https://github.com/myudelson/hmm-scalable
☆38May 7, 2019Updated 6 years ago
msr-ds3 / nyctaxi
View on GitHub
☆10Jan 30, 2017Updated 9 years ago
vvhg1 / guided-text-generation-with-classifier-free-language-diffusion
View on GitHub
Exploring classifier-free guidance in a DDPM language model for text generation towards emotion targets.
☆11Sep 7, 2025Updated 5 months ago
erikbern / conversion
View on GitHub
Because you're computing conversion rates wrong
☆16May 23, 2017Updated 8 years ago
ycpNotFound / GeoGen
View on GitHub
A pipeline for the automatic construction of geometry problems along with step-by-step solutions.
☆17Aug 27, 2025Updated 6 months ago
searchisko / elasticsearch-river-jira
View on GitHub
JIRA River Plugin for Elasticsearch
☆24Apr 17, 2018Updated 7 years ago
inclusionAI / MoBE
View on GitHub
Mixture-of-Basis-Experts for Compressing MoE-based LLMs
☆29Dec 24, 2025Updated 2 months ago
redmonk / bluebird
View on GitHub
A web front end for R Twitter sentiment analysis
☆18Sep 2, 2011Updated 14 years ago
codingchild2424 / MonaCoBERT
View on GitHub
Monotonic Attention based ConvBERT for Knowledge Tracing
☆15Sep 14, 2022Updated 3 years ago
eth-lre / book2dial
View on GitHub
Generating Teacher Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots, ACL 2024 Findings
☆12Mar 27, 2025Updated 11 months ago
Knewton / lentil
View on GitHub
An implementation of the Latent Skill Embedding model
☆10Feb 19, 2016Updated 10 years ago
dgrapov / networkly
View on GitHub
network visualization using plotly
☆12Sep 25, 2016Updated 9 years ago
jhs / jss
View on GitHub
Unix stream tool using for Javascript and JSON
☆16Feb 26, 2011Updated 15 years ago
PanoPepino / beanim
View on GitHub
Repository to create easy and straightforward slides in Manim together with Manim-slides
☆20Nov 26, 2025Updated 3 months ago
ntamas / blockmodel
View on GitHub
Fitting stochastic blockmodels to graphs
☆17Jul 8, 2016Updated 9 years ago
xbmxb / StructureCharacterization4DD
View on GitHub
https://openreview.net/forum?id=OC1o4_OI6Jw
☆13May 27, 2022Updated 3 years ago
lucataco / cog-hunyuanvideo-lora-trainer
View on GitHub
Memory-optimized training scripts for video models based on Diffusers
☆14Jan 3, 2025Updated last year
RamonKaspar / MathPrompter
View on GitHub
MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…
☆13Apr 12, 2025Updated 10 months ago
rathaumons / pyppbox
View on GitHub
Toolbox for people detecting, tracking, and re-identifying.
☆18Feb 7, 2026Updated 3 weeks ago
myudelson / hmm-scalable
View on GitHub
Command-line utility for fitting Hidden Markov Models at scale
☆57Jul 21, 2022Updated 3 years ago
QuentinAndre / DistributionBuilder
View on GitHub
A Javascript library to conveniently add distribution builders to your online and offline experiments.
☆15May 25, 2023Updated 2 years ago
dcreager / graffle-export
View on GitHub
Command-line utility for exporting OmniGraffle documents
☆25Oct 18, 2016Updated 9 years ago
SCUT-DLVCLab / SCUT-EnsExam
View on GitHub
SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper im…
☆18Dec 5, 2023Updated 2 years ago
arghosh / BOBCAT
View on GitHub
☆15Jan 2, 2022Updated 4 years ago
amit-sharma / splitdoor-causal-criterion
View on GitHub
A method for estimating causal effects in time-series data. Uses available data to automatically find natural experiments for identifying…
☆17Dec 16, 2019Updated 6 years ago
zmy-9 / MF-DAKT
View on GitHub
2021 CIKM: source code for the Multi-Factors Aware Dual-Attentional Knowledge Tracing (MF-DAKT)
☆16Apr 25, 2023Updated 2 years ago
songys / single_turn_dialogue
View on GitHub
사전에서 대화 예문만 추출한 데이터
☆16Apr 24, 2023Updated 2 years ago
jvparidon / lmerMultiMember
View on GitHub
Multiple membership random effects. Wrapper around lme4::lmer and lme4::glmer.
☆22Oct 18, 2023Updated 2 years ago
hyunwoongko / beyond-lm
View on GitHub
Beyond LM: How can language model go forward in the future?
☆15Apr 30, 2023Updated 2 years ago
jsoma / storytelling-2015
View on GitHub
☆15Dec 10, 2015Updated 10 years ago
4g / dcool
View on GitHub
data center cooling with reinforcement learning
☆15Jun 3, 2020Updated 5 years ago