Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral
☆37Nov 18, 2025Updated 7 months ago
Alternatives and similar repositories for mathtutorbench
Users that are interested in mathtutorbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral☆40Dec 11, 2025Updated 6 months ago
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆86Sep 17, 2025Updated 9 months ago
- An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors☆28Mar 2, 2026Updated 3 months ago
- ☆24Jul 6, 2021Updated 4 years ago
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆22Jun 3, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Langu…☆16Apr 12, 2025Updated last year
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆19Jun 25, 2024Updated 2 years ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆36Dec 24, 2025Updated 6 months ago
- Generating Teacher Student Interactions from Textbooks for Cost-Effective Development of Educational Chatbots, ACL 2024 Findings☆13Mar 27, 2025Updated last year
- An implementation of the Latent Skill Embedding model☆10Feb 19, 2016Updated 10 years ago
- Askalot CQA System of Next Generation☆26Dec 14, 2022Updated 3 years ago
- development moved to https://github.com/myudelson/hmm-scalable☆38May 7, 2019Updated 7 years ago
- Repo for paper: https://arxiv.org/abs/2404.06479☆30Oct 3, 2024Updated last year
- ☆21Apr 16, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A simple Dataset generator for Moving Mnist☆14May 26, 2023Updated 3 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- ☆23Mar 31, 2023Updated 3 years ago
- ☆15Jan 2, 2022Updated 4 years ago
- ☆23Aug 1, 2022Updated 3 years ago
- ☆26Apr 18, 2020Updated 6 years ago
- Code for spike coding networks☆16Jan 8, 2021Updated 5 years ago
- ☆27Jun 2, 2026Updated 3 weeks ago
- Data, code, and images for a posting summarizing three studies about pie charts☆15Jul 12, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 使用PyQt5+FastAPI+SQLAlchemy+Redis+Celery做的一个登录注册页,使用邮箱注册与验证,前后分离,其中:QtLoginRegistrationClient仓库存放 GUI,QtLoginRegistrationServer仓库存放 API☆11Mar 25, 2024Updated 2 years ago
- CEduMEval : A Chinese educational multi-task evaluation benchmark☆17Nov 18, 2024Updated last year
- 사전에서 대화 예문만 추출한 데이터☆16Apr 24, 2023Updated 3 years ago
- Race and Ethnicity based on name using data from census, voter reg. files, etc.☆11Jan 17, 2018Updated 8 years ago
- Code and dataset for the paper 'Optimized Prediction of Weapon Effectiveness in BVR Air Combat Scenarios Using Enhanced Regression Models…☆17Jun 29, 2025Updated last year
- Beyond LM: How can language model go forward in the future?☆15Apr 30, 2023Updated 3 years ago
- Model Context Protocol (MCP) server for go-zero framework - Generate APIs, RPC services, and models with AI assistance.☆44Jan 31, 2026Updated 5 months ago
- Unix stream tool using for Javascript and JSON☆16Feb 26, 2011Updated 15 years ago
- Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…☆24May 28, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper im…☆20Dec 5, 2023Updated 2 years ago
- 1-day R workshop for experienced users. First session covers script writing, file names and overall project structures. The second sessio…☆13Jan 25, 2019Updated 7 years ago
- Exploring classifier-free guidance in a DDPM language model for text generation towards emotion targets.☆11Sep 7, 2025Updated 9 months ago
- ☆36May 24, 2025Updated last year
- Memory-optimized training scripts for video models based on Diffusers☆17Jan 3, 2025Updated last year
- Repository to create easy and straightforward slides in Manim together with Manim-slides☆23Nov 26, 2025Updated 7 months ago
- Evaluation toolkit of the informative tracking benchmark comprising 9 scenarios, 180 diverse videos, and new challenges.☆17Dec 14, 2021Updated 4 years ago