Khan / tutoring-accuracy-dataset
This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the performance and challenges of Large Language Models (LLMs) in math tutoring scenarios, providing a benchmark dataset for evaluating LLM accuracy in educational contexts.
☆40Updated 7 months ago
Alternatives and similar repositories for tutoring-accuracy-dataset:
Users that are interested in tutoring-accuracy-dataset are comparing it to the libraries listed below
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆88Updated 7 months ago
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆35Updated 8 months ago
- ☆33Updated last year
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆50Updated 3 weeks ago
- ☆11Updated last year
- ☆23Updated last year
- Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…☆94Updated last month
- Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text☆10Updated 5 months ago
- ☆91Updated 10 months ago
- ☆21Updated 3 years ago
- ☆221Updated this week
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆126Updated last year
- ☆104Updated 10 months ago
- An Education Tutoring Chatbot based on Learning Science Principles powered by Large Language Models☆50Updated 4 months ago
- Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughts☆58Updated last year
- Code and data for the paper "Measuring Conversational Uptake: A Case-Study on Student-Teacher Interactions"☆24Updated 2 years ago
- ☆36Updated 4 months ago
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice De…☆15Updated last year
- ☆35Updated 5 months ago
- ☆45Updated last week
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆35Updated last year
- ☆34Updated 5 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆70Updated 3 months ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Updated last year
- The Prism Alignment Project☆70Updated 11 months ago
- Official repository for the AnnoMI dataset: the first public collection of expert-annotated MI transcripts.☆66Updated 2 years ago
- Package to extract connotation frames☆83Updated last year
- ☆42Updated last year
- The official repo for SocKET: Social Knowledge Evaluation Tests☆23Updated last year
- ☆14Updated 10 months ago