An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors
☆26Mar 2, 2026Updated this week
Alternatives and similar repositories for UnifyingAITutorEvaluation
Users that are interested in UnifyingAITutorEvaluation are comparing it to the libraries listed below
Sorting:
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆22Jun 3, 2024Updated last year
- ☆24Jul 6, 2021Updated 4 years ago
- This module is a tool for calculating correlations such as Partial, Tetrachoric, Intraclass correlation coefficients, Bootstrap agreement…☆11Feb 16, 2026Updated 2 weeks ago
- Adaptive Cooperative Particle Swarm Optimizer☆11Apr 26, 2016Updated 9 years ago
- ☆11Aug 23, 2023Updated 2 years ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆111Apr 19, 2025Updated 10 months ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated 11 months ago
- Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral☆31Dec 11, 2025Updated 2 months ago
- The Universal Anaphora Scorer☆15Sep 2, 2024Updated last year
- Tutorials for CMU's 2023 Generative AI Tutorial Series☆11Jul 18, 2023Updated 2 years ago
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- An Obj-C/iOS web socket helper class, with the ability to send and receive data via simple delegate functions, and function calls. I also…☆11May 19, 2020Updated 5 years ago
- The Chrome Experience User Survey (CUES) extension.☆12Sep 23, 2015Updated 10 years ago
- ☆12Jun 2, 2025Updated 9 months ago
- Adaptation of SICSS lectures for CU Boulder site (August 13th - 17th)☆12Sep 22, 2018Updated 7 years ago
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- 🫧 Code for Holistic Reasoning with Long-Context LMs: A Benchmark for Database Operations on Massive Textual Data (Maekawa*, Iso* et al.…☆12Feb 25, 2025Updated last year
- ☆12Feb 16, 2024Updated 2 years ago
- Python coherence evaluation tool using Stanford's CoreNLP.☆10Feb 2, 2020Updated 6 years ago
- ☆13Sep 27, 2022Updated 3 years ago
- 11-785 Group Project: YouShen Poetry generation☆10Dec 23, 2020Updated 5 years ago
- ☆11May 30, 2024Updated last year
- Paper: Relational Sentence Embedding for Flexible Semantic Matching☆12May 22, 2024Updated last year
- Official implementation of “Watch Your Step: A Fine-Grained Evaluation Framework for Multi-hop Knowledge Editing in Large Language Models…☆46Nov 25, 2025Updated 3 months ago
- Source code for ACL 2020 paper "A Span-based Linearization for Constituent Trees"☆13Jan 12, 2022Updated 4 years ago
- Public repo for the paper: "Modeling Intensification for Sign Language Generation: A Computational Approach" by Mert Inan*, Yang Zhong*, …☆14Mar 15, 2022Updated 3 years ago
- PyTAIL - Interactive and Incremental Learning of NLP Models with Human in the Loop for Online Data☆13Dec 3, 2022Updated 3 years ago
- 2019 PyCon kr tutorial: "네이버 영화 평점 데이터로 자연어처리 논문 구현 시작하기"☆13Aug 21, 2019Updated 6 years ago
- Language model for cancer domain☆14Apr 7, 2022Updated 3 years ago
- NAEP Math Assessment Item Score Prediction Challenge (Spring 2023)☆15Jun 8, 2023Updated 2 years ago
- ☆14Sep 17, 2025Updated 5 months ago
- ☆16Apr 29, 2020Updated 5 years ago
- Official implementation of the ACL 2022 paper "Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization"☆14Dec 26, 2022Updated 3 years ago