kaushal0494/UnifyingAITutorEvaluation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kaushal0494/UnifyingAITutorEvaluation)

kaushal0494 / UnifyingAITutorEvaluation

An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors

☆29

Alternatives and similar repositories for UnifyingAITutorEvaluation

Users that are interested in UnifyingAITutorEvaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rosewang2008 / bridge
View on GitHub
NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…
☆46Jul 21, 2024Updated 2 years ago
Zhenwen-NLP / MathChat
View on GitHub
Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…
☆22Jun 3, 2024Updated 2 years ago
google-research-datasets / Education-Dialogue-Dataset
View on GitHub
Dataset of conversations, generated by prompting Gemini Ultra. These are conversations between a teacher and a student, where the teacher…
☆36Oct 29, 2024Updated last year
copenlu / awesome-text-interpretability
View on GitHub
A repo to keep all resources about interpretability in NLP organised and up to date
☆13Nov 22, 2020Updated 5 years ago
ddemszky / classroom-transcript-analysis
View on GitHub
☆43Jun 15, 2026Updated last month
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
adrianeboyd / BrillMooreSpellChecker
View on GitHub
Spell checker using Brill and Moore's noisy channel error model
☆13Jan 9, 2019Updated 7 years ago
eth-lre / PedagogicalRL
View on GitHub
Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral
☆42Dec 11, 2025Updated 7 months ago
BinWang28 / FacEval
View on GitHub
EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization
☆13Mar 20, 2025Updated last year
JiwooKimAR / dmath
View on GitHub
☆12Feb 16, 2024Updated 2 years ago
ymgw55 / WSMD
View on GitHub
Improving word mover’s distance by leveraging self-attention matrix (Published in EMNLP 2023 Findings)
☆10Mar 10, 2026Updated 4 months ago
yumoxu / oreo
View on GitHub
☆13Sep 27, 2022Updated 3 years ago
otacke / h5p-essay
View on GitHub
experimental H5P content for automated feedback on texts
☆17Jul 10, 2026Updated 2 weeks ago
laser-institute / network-analysis
View on GitHub
Social Network Analysis and STEM Education is designed to prepare researchers to apply network analysis in order to better understand and…
☆15Jul 14, 2025Updated last year
thu-coai / LongSafety
View on GitHub
[ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models
☆16Jun 18, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
BinWang28 / RSE
View on GitHub
Paper: Relational Sentence Embedding for Flexible Semantic Matching
☆12May 22, 2024Updated 2 years ago
WHGTyen / BIG-Bench-Mistake
View on GitHub
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆89Aug 10, 2024Updated last year
megagonlabs / holobench
View on GitHub
🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…
☆12Feb 25, 2025Updated last year
zepingyu0512 / arithmetic-mechanism
View on GitHub
code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis
☆12Nov 17, 2024Updated last year
seopbo / first_project_nlp
View on GitHub
2019 PyCon kr tutorial: "네이버 영화 평점 데이터로 자연어처리 논문 구현 시작하기"
☆13Aug 21, 2019Updated 6 years ago
EduNLP / edu-convokit
View on GitHub
Edu-ConvoKit: An Open-Source Framework for Education Conversation Data
☆115Apr 19, 2025Updated last year
lukas / otto
View on GitHub
☆29May 7, 2024Updated 2 years ago
tcapelle / torch_moving_mnist
View on GitHub
A simple Dataset generator for Moving Mnist
☆14May 26, 2023Updated 3 years ago
conceptmath / conceptmath
View on GitHub
[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …
☆26May 29, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
feilaz / AI_Powered_Math_Tutoring
View on GitHub
This repository contains code used in "AI-Powered Math Tutoring: Platform for Personalized and Adaptive Education" paper.
☆18Feb 25, 2025Updated last year
Strong-AI-Lab / ChatLogic
View on GitHub
☆16Dec 17, 2023Updated 2 years ago
dreamtheater123 / VoxEval
View on GitHub
Github repository for ACL 2025 paper: VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language Models
☆24Jun 16, 2025Updated last year
IBM / bridging-resolution
View on GitHub
☆14Sep 17, 2025Updated 10 months ago
okoge-kaz / moe-recipes
View on GitHub
Ongoing research training Mixture of Expert models.
☆22Sep 16, 2024Updated last year
LLM-MI-Research / Actionable-MI
View on GitHub
☆15Jan 20, 2026Updated 6 months ago
MANGA-UOFA / NAUS
View on GitHub
Official implementation of the ACL 2022 paper "Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization"
☆14Dec 26, 2022Updated 3 years ago
kaistAI / Knowledge-Entropy
View on GitHub
[ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition
☆17Nov 25, 2024Updated last year
hyunsooseol / seolmatrix
View on GitHub
This module is a tool for calculating correlations such as Partial, Tetrachoric, Intraclass correlation coefficients, Canonical correlati…
☆13Jun 17, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
montehoover / DynaGuard
View on GitHub
Code for "DynaGuard: A Dynamic Guardrail Model With User-Defined Policies."
☆23Nov 3, 2025Updated 8 months ago
ArgLab / writing_observer
View on GitHub
Writing Observer and Learning Observer: A system for monitoring learning process data, with an initial focus on writing process data from…
☆12Updated this week
binwiederhier / sandclaude
View on GitHub
Run Claude in Docker with --dangerously-skip-permissions
☆22May 30, 2026Updated last month
microsoft / MetaXL
View on GitHub
Meta Representation Transformation for Low-resource Cross-lingual Learning
☆41May 5, 2021Updated 5 years ago
ybai-nlp / EduBench
View on GitHub
Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…
☆25May 28, 2025Updated last year
science-of-finetuning / sparsity-artifacts-crosscoders
View on GitHub
Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.
☆17Jul 6, 2026Updated 2 weeks ago
UKPLab / argotario
View on GitHub
Argotario: a multi-lingual serious game to tackle fallacious argumentation
☆16Oct 14, 2025Updated 9 months ago