FlexEval is an LLM evaluation tool designed for practical quantitative analysis.
☆16Sep 19, 2025Updated 5 months ago
Alternatives and similar repositories for FlexEval
Users that are interested in FlexEval are comparing it to the libraries listed below
Sorting:
- Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors, EMNLP 2025 Oral☆32Nov 18, 2025Updated 3 months ago
- This is the data associated with the PERSUADE Corpus 2.0 version☆49Feb 3, 2026Updated last month
- ☆27Aug 20, 2021Updated 4 years ago
- ☆11Feb 2, 2024Updated 2 years ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆111Apr 19, 2025Updated 10 months ago
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Jul 21, 2024Updated last year
- normalizer of numerical / temporal expression☆11Sep 2, 2018Updated 7 years ago
- Package to estimate the grouping loss of a classifier, based on the paper "Beyond calibration: estimating the grouping loss of modern neu…☆11Dec 14, 2024Updated last year
- ☆14Oct 17, 2024Updated last year
- Retrieval augmented generation for middle-school math question answering and hint generation.☆43Feb 19, 2025Updated last year
- Novella is a build system for processing data in a temporary directory isolated from the project, designed for documentation source code …☆13Jan 11, 2024Updated 2 years ago
- Code of the paper "Beyond calibration: estimating the grouping loss of modern neural networks" published in ICLR 2023.☆12Nov 21, 2023Updated 2 years ago
- Code for experiments done for EMNLP2020.☆11Dec 8, 2022Updated 3 years ago
- Getting Started with Git and GitHub for R Users☆11Oct 29, 2019Updated 6 years ago
- Plurimath Ruby gem☆20Updated this week
- JIRA River Plugin for Elasticsearch☆24Apr 17, 2018Updated 7 years ago
- Image Tampering Detection WebApp☆12Jun 12, 2024Updated last year
- Official code for "Automated Scoring for Reading Comprehension via In-context BERT Tuning" (AIED 2022)☆13May 23, 2022Updated 3 years ago
- text mining, regex, N-grams, fuzzy matching☆13Jan 22, 2021Updated 5 years ago
- An implementation of the Latent Skill Embedding model☆10Feb 19, 2016Updated 10 years ago
- Liftkit design-system enhanced with Tailwind☆36Dec 14, 2025Updated 2 months ago
- SHAPR - An AI approach to predict 3D cell shapes from 2D microscopic images☆14May 31, 2023Updated 2 years ago
- ☆12Nov 28, 2023Updated 2 years ago
- ☆16Jun 5, 2017Updated 8 years ago
- A collection of R packages for educational datamining☆15Jan 14, 2019Updated 7 years ago
- Cracks numeric passwords of password-protected PDF files using bruteforce☆14May 15, 2025Updated 9 months ago
- Workshop for building generative AI applications from first principles☆30Dec 9, 2025Updated 2 months ago
- Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices + …☆14Sep 7, 2019Updated 6 years ago
- A template for creating documentation using Astro's Content Collections API. It creates a Table of Contents based on the markdown files a…☆11Feb 21, 2023Updated 3 years ago
- Learning High-Quality and General-Purpose Phrase Representations. Findings of EACL 2024☆16Feb 29, 2024Updated 2 years ago
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13May 27, 2022Updated 3 years ago
- Python wrapper for the FrameNet library.☆24Jul 26, 2011Updated 14 years ago
- RStudio notebooks for deep learning in R☆14Apr 27, 2020Updated 5 years ago
- ☆15Jan 2, 2022Updated 4 years ago
- ☆14Dec 4, 2024Updated last year
- ☆26Jan 11, 2019Updated 7 years ago
- Convert the data from stanford drone dataset to kitti format☆11Nov 20, 2017Updated 8 years ago
- IGI-Research-Data is repo that contain all the research information for Project I.G.I 1 game for educational purpose.☆21Oct 27, 2025Updated 4 months ago
- CRNN (CNN+RNN) for OCR using Keras / License Plate Recognition☆11Sep 11, 2020Updated 5 years ago