Khan / tutoring-accuracy-dataset
This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the performance and challenges of Large Language Models (LLMs) in math tutoring scenarios, providing a benchmark dataset for evaluating LLM accuracy in educational contexts.
☆28Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for tutoring-accuracy-dataset
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆77Updated 3 months ago
- ☆10Updated last year
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆29Updated 4 months ago
- ☆29Updated last year
- ☆86Updated 5 months ago
- ☆28Updated last month
- Code for "Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies"☆26Updated 7 months ago
- ☆20Updated last year
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆45Updated 8 months ago
- An attribution library for LLMs☆34Updated 2 months ago
- The Synthetic-Persona-Chat dataset is a synthetically generated persona-based dialogue dataset. It extends the original Persona-Chat data…☆76Updated 10 months ago
- ☆94Updated 6 months ago
- ☆32Updated last month
- ☆14Updated last year
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- ☆74Updated 4 months ago
- The Prism Alignment Project☆37Updated 6 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆62Updated last year
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice De…☆13Updated 7 months ago
- An Education Tutoring Chatbot based on Learning Science Principles powered by Large Language Models☆42Updated 2 weeks ago
- The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".☆75Updated 10 months ago
- The course introduces the use of open-source large language models (LLMs) from the Hugging Face ecosystem for research in the behavioral …☆45Updated 5 months ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆19Updated last year
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆68Updated 7 months ago
- ☆199Updated this week
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆29Updated 8 months ago
- TopicGPT: A Prompt-Based Framework for Topic Modeling (NAACL'24)☆221Updated last week
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- ☆82Updated 6 months ago
- ☆49Updated last week