Khan / tutoring-accuracy-dataset
This repository hosts the paper โLLM Based Math Tutoring: Challenges and Datasetโ, along with the accompanying dataset. It explores the performance and challenges of Large Language Models (LLMs) in math tutoring scenarios, providing a benchmark dataset for evaluating LLM accuracy in educational contexts.
โ42Updated 7 months ago
Alternatives and similar repositories for tutoring-accuracy-dataset:
Users that are interested in tutoring-accuracy-dataset are comparing it to the libraries listed below
- ๐งฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023โ51Updated last month
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Dataโ92Updated last week
- NAACL 2024. Code & Dataset for "๐ Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakeโฆโ37Updated 9 months ago
- โ93Updated 11 months ago
- โ33Updated 2 years ago
- Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutorsโ11Updated last week
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice Deโฆโ17Updated last year
- โ14Updated 11 months ago
- โ93Updated 4 months ago
- โ11Updated last year
- โ106Updated 11 months ago
- โ265Updated 3 months ago
- The Prism Alignment Projectโ75Updated last year
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Noveltyโ78Updated last year
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.โ85Updated 10 months ago
- Get answers to research questions from 200M+ papers. Link to demo -โ206Updated last year
- โ23Updated 2 years ago
- โ287Updated last year
- Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughtsโ59Updated last year
- โ35Updated 6 months ago
- A Computational Framework for Behavioral Assessment of LLM Therapistsโ27Updated 6 months ago
- โ68Updated last year
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering datasetโ157Updated last year
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)โ209Updated this week
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersโ128Updated last year
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Modโฆโ36Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".โ158Updated 11 months ago
- An Education Tutoring Chatbot based on Learning Science Principles powered by Large Language Modelsโ50Updated 5 months ago
- Official repository for the AnnoMI dataset: the first public collection of expert-annotated MI transcripts.โ69Updated 2 years ago
- โ37Updated 5 months ago