An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors
☆28Mar 2, 2026Updated 3 months ago
Alternatives and similar repositories for UnifyingAITutorEvaluation
Users that are interested in UnifyingAITutorEvaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Jul 21, 2024Updated last year
- ☆24Jul 6, 2021Updated 4 years ago
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆84Sep 17, 2025Updated 8 months ago
- Python coherence evaluation tool using Stanford's CoreNLP.☆10Feb 2, 2020Updated 6 years ago
- A repo to keep all resources about interpretability in NLP organised and up to date☆13Nov 22, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated last year
- ☆13Sep 27, 2022Updated 3 years ago
- Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral☆39Dec 11, 2025Updated 6 months ago
- A neural branch predictor tested using CPU emulator, testing both supervised learning and reinforcement learning (for COS 583: Great Mome…☆15May 17, 2017Updated 9 years ago
- Syntax-aware Word Mover’s Distance for Sentence Similarity Modeling☆20Nov 6, 2023Updated 2 years ago
- A proof-of-concept implementation of Titans: models mixing long-term, short-term and persistent memories☆24Apr 9, 2025Updated last year
- code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"☆19Mar 10, 2025Updated last year
- [ACL 2026 Main Conference] Paper list for the survey "A Survey of Deep Learning for Geometry Problem Solving"☆36Sep 14, 2025Updated 9 months ago
- Social Network Analysis and STEM Education is designed to prepare researchers to apply network analysis in order to better understand and…☆14Jul 14, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models☆24Jun 16, 2025Updated last year
- Ongoing research training Mixture of Expert models.☆22Sep 16, 2024Updated last year
- ☆14Sep 17, 2025Updated 8 months ago
- Official implementation of the ACL 2022 paper "Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization"☆14Dec 26, 2022Updated 3 years ago
- [ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition☆17Nov 25, 2024Updated last year
- Code for the paper "Attention Temperature Matters in Abstractive Summarization Distillation"(https://arxiv.org/abs/2106.03441)☆13Mar 25, 2022Updated 4 years ago
- The TalkMoves Dataset: K-12 mathematics lesson transcripts annotated for teacher and student discursive moves☆37Feb 4, 2022Updated 4 years ago
- Writing Observer and Learning Observer: A system for monitoring learning process data, with an initial focus on writing process data from…☆12Jun 9, 2026Updated last week
- ☆12Nov 21, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper "Exploring Knowledge Tracing in Tutor-Student Dialogues using LLMs" at LAK2025.☆35Feb 12, 2025Updated last year
- Summarize a document conditioned on aspect keywords.☆17Sep 7, 2022Updated 3 years ago
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.☆17Aug 27, 2025Updated 9 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆33Aug 5, 2025Updated 10 months ago
- Official implementation of “Watch Your Step: A Fine-Grained Evaluation Framework for Multi-hop Knowledge Editing in Large Language Models…☆45Nov 25, 2025Updated 6 months ago
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆26Mar 3, 2025Updated last year
- Argotario: a multi-lingual serious game to tackle fallacious argumentation☆16Oct 14, 2025Updated 8 months ago
- The Chrome Experience User Survey (CUES) extension.☆12Sep 23, 2015Updated 10 years ago
- Easily turn large English text datasets into Japanese text datasets using open LLMs.☆29Jan 20, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- gloss browser for https://github.com/foss-np/np-l10n-glossary☆10Oct 29, 2017Updated 8 years ago
- An Obj-C/iOS web socket helper class, with the ability to send and receive data via simple delegate functions, and function calls. I also…☆11May 19, 2020Updated 6 years ago
- ACL 2022: Just Rank: Rethinking Evaluation with Word and Sentence Similarities☆35Dec 14, 2022Updated 3 years ago
- ☆40Feb 4, 2026Updated 4 months ago
- [EMNLP2022] Released code for paper "Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition"☆22Feb 9, 2023Updated 3 years ago
- 11-785 Group Project: YouShen Poetry generation☆10Dec 23, 2020Updated 5 years ago
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆26Dec 20, 2024Updated last year