FlexEval is an LLM evaluation tool designed for practical quantitative analysis.
☆16Sep 19, 2025Updated 6 months ago
Alternatives and similar repositories for FlexEval
Users that are interested in FlexEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Package to estimate the grouping loss of a classifier, based on the paper "Beyond calibration: estimating the grouping loss of modern neu…☆11Dec 14, 2024Updated last year
- Code of the paper "Beyond calibration: estimating the grouping loss of modern neural networks" published in ICLR 2023.☆12Nov 21, 2023Updated 2 years ago
- ☆27Aug 20, 2021Updated 4 years ago
- This is the data associated with the PERSUADE Corpus 2.0 version☆51Feb 3, 2026Updated last month
- Learning High-Quality and General-Purpose Phrase Representations. Findings of EACL 2024☆16Feb 29, 2024Updated 2 years ago
- Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral☆32Nov 18, 2025Updated 4 months ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆112Apr 19, 2025Updated 11 months ago
- normalizer of numerical / temporal expression☆11Sep 2, 2018Updated 7 years ago
- Official code for "Automated Scoring for Reading Comprehension via In-context BERT Tuning" (AIED 2022)☆13May 23, 2022Updated 3 years ago
- Retrieval augmented generation for middle-school math question answering and hint generation.☆44Feb 19, 2025Updated last year
- ☆16Jun 5, 2017Updated 8 years ago
- GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts☆40Sep 30, 2025Updated 5 months ago
- JIRA River Plugin for Elasticsearch☆24Apr 17, 2018Updated 7 years ago
- 日本語テキストに対する wikification のためのソフトウェア☆17Mar 14, 2017Updated 9 years ago
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Jul 21, 2024Updated last year
- ☆15Sep 24, 2018Updated 7 years ago
- The MOOC Replication Framework (MORF)☆18Aug 24, 2020Updated 5 years ago
- ☆22Jan 9, 2025Updated last year
- Getting Started with Git and GitHub for R Users☆11Oct 29, 2019Updated 6 years ago
- An implementation of the Latent Skill Embedding model☆10Feb 19, 2016Updated 10 years ago
- ☆14Oct 17, 2024Updated last year
- Code for the paper "Exploring Knowledge Tracing in Tutor-Student Dialogues using LLMs" at LAK2025.☆28Feb 12, 2025Updated last year
- Specification language for generating Generalized Linear Models (with or without mixed effects) from conceptual models☆22Apr 26, 2022Updated 3 years ago
- moocRP: an open-source learning analytics Research Platform☆15Mar 11, 2016Updated 10 years ago
- Pipeline for Extracting and Organizing Procedural Information in Tutorial Videos☆24Nov 16, 2024Updated last year
- A simple tutorial on creating online books☆26Aug 11, 2021Updated 4 years ago
- Code for experiments done for EMNLP2020.☆11Dec 8, 2022Updated 3 years ago
- ☆11Feb 2, 2024Updated 2 years ago
- ☆12Nov 28, 2023Updated 2 years ago
- 《랭체인 완벽 입문》 예제 코드☆23Feb 7, 2024Updated 2 years ago
- Repository for the CommonLit Ease of Readability Corpus☆24Apr 17, 2024Updated last year
- A simple Dataset generator for Moving Mnist☆14May 26, 2023Updated 2 years ago
- ☆26Jan 11, 2019Updated 7 years ago
- Image Tampering Detection WebApp☆12Jun 12, 2024Updated last year
- 딥러닝 NLP☆30Feb 13, 2019Updated 7 years ago
- Research code for "Choosing to grow a graph" project. Contains code for network generation and model estimation.☆25Jul 31, 2020Updated 5 years ago
- SHAPR - An AI approach to predict 3D cell shapes from 2D microscopic images☆14May 31, 2023Updated 2 years ago
- Python wrapper for the FrameNet library.☆24Jul 26, 2011Updated 14 years ago
- ☆16Oct 24, 2023Updated 2 years ago