DigitalHarborFoundation/FlexEval

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DigitalHarborFoundation/FlexEval)

DigitalHarborFoundation / FlexEval

FlexEval is an LLM evaluation tool designed for practical quantitative analysis.

☆16

Alternatives and similar repositories for FlexEval

Users that are interested in FlexEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

arghosh / NeurIPSEducation2020
View on GitHub
☆28Aug 20, 2021Updated 4 years ago
NAEP-AS-Challenge / reading-prediction
View on GitHub
Information about the Challenge
☆21Jul 22, 2022Updated 4 years ago
kstats / CIMA
View on GitHub
☆24Jul 6, 2021Updated 5 years ago
kobauman / SULM
View on GitHub
☆16Jun 5, 2017Updated 9 years ago
EduNLP / edu-convokit
View on GitHub
Edu-ConvoKit: An Open-Source Framework for Education Conversation Data
☆115Apr 19, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
CAHLR / moocRP
View on GitHub
moocRP: an open-source learning analytics Research Platform
☆15Mar 11, 2016Updated 10 years ago
saghirb / Getting-Started-with-Git-and-GitHub-for-R-Users
View on GitHub
Getting Started with Git and GitHub for R Users
☆11Oct 29, 2019Updated 6 years ago
AngusGLChen / LearningQ
View on GitHub
Dataset and source code for LearningQ: A Large-scale Dataset for Educational Question Generation (ICWSM 2018).
☆65Jun 16, 2020Updated 6 years ago
jpcorb20 / bet-backtranslation-paraphrase-experiment
View on GitHub
Code for experiments done for EMNLP2020.
☆11Dec 8, 2022Updated 3 years ago
IEDMS / REDM
View on GitHub
A collection of R packages for educational datamining
☆15Jan 14, 2019Updated 7 years ago
elleobrien / typo_test
View on GitHub
☆12Nov 28, 2023Updated 2 years ago
tcapelle / torch_moving_mnist
View on GitHub
A simple Dataset generator for Moving Mnist
☆14May 26, 2023Updated 3 years ago
michelecafagna26 / vinvl-visualbackbone
View on GitHub
Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.
☆12Nov 27, 2022Updated 3 years ago
kstats / CausalQG
View on GitHub
☆15Apr 19, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
weilinie / GAN-QA
View on GitHub
Applying GANs in improving question generation and answering
☆12Oct 1, 2017Updated 8 years ago
chao1224 / SGNN-EBM
View on GitHub
Structured Multi-task Learning for Molecular Property Prediction, AISTATS'22 (https://proceedings.mlr.press/v151/liu22e.html)
☆14Jul 6, 2022Updated 4 years ago
arghosh / BOBCAT
View on GitHub
☆15Jan 2, 2022Updated 4 years ago
stanford-crfm / halie
View on GitHub
☆18Dec 11, 2023Updated 2 years ago
binwiederhier / sandclaude
View on GitHub
Run Claude in Docker with --dangerously-skip-permissions
☆22May 30, 2026Updated last month
gordon8214 / get-youtube-transcripts
View on GitHub
A Python script to retrieve plain text transcripts from YouTube videos
☆27Oct 30, 2016Updated 9 years ago
hyintell / RetrievalQA
View on GitHub
Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…
☆68May 28, 2024Updated 2 years ago
machenslab / spikes
View on GitHub
Code for spike coding networks
☆16Jan 8, 2021Updated 5 years ago
Willyoung2017 / PER-CHAT
View on GitHub
Personalized Response Generation via Generative Split Memory Network
☆12Sep 6, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
WolfNiu / polite-dialogue-generation
View on GitHub
Code for "Polite Dialogue Generation Without Parallel Data"
☆25Nov 24, 2018Updated 7 years ago
d-r-b-o-b / EightyFivePercentRule
View on GitHub
Code to replicate the figures from "The Eighty Five Percent Rule for Optimal Learning"
☆20Aug 19, 2019Updated 6 years ago
thuynh323 / Natural-language-processing
View on GitHub
text mining, regex, N-grams, fuzzy matching
☆13Jan 22, 2021Updated 5 years ago
zoezou2015 / text2math
View on GitHub
Code for the paper ``Text2Math: End-to-end Parsing Text into Math Expressions" accepted by EMNLP 2019
☆16Aug 20, 2019Updated 6 years ago
SparkDevF19 / ai-program-translation
View on GitHub
Program Translator AI built on Pytorch
☆15Dec 19, 2019Updated 6 years ago
jensjorisdecorte / Skill-Extraction-benchmark
View on GitHub
Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.
☆17Jul 18, 2024Updated 2 years ago
marrlab / SHAPR
View on GitHub
SHAPR - An AI approach to predict 3D cell shapes from 2D microscopic images
☆17May 31, 2023Updated 3 years ago
open-compass / ProSA
View on GitHub
[EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs
☆29May 22, 2025Updated last year
Unity-Technologies / UGS-SQL-Cookbook
View on GitHub
A collection of common SQL queries and best practices for use with Unity Gaming Services Analytics
☆18May 26, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xbmxb / StructureCharacterization4DD
View on GitHub
https://openreview.net/forum?id=OC1o4_OI6Jw
☆13May 27, 2022Updated 4 years ago
lijierui / group-attention
View on GitHub
☆14May 28, 2019Updated 7 years ago
wikiabhi / Cifar-10
View on GitHub
Image Classification with Cifar-10 dataset
☆14Jul 19, 2018Updated 8 years ago
applicaai / CCpdf
View on GitHub
Index of URLs to pdf files all over the internet and scripts
☆25May 2, 2023Updated 3 years ago
ArtsEngine / concreteness
View on GitHub
concreteness ratings list
☆27Feb 22, 2017Updated 9 years ago
xxbidiao / plug-and-blend
View on GitHub
Codebase for public release of the plug-and-blend framework.
☆24Mar 29, 2022Updated 4 years ago
BorgwardtLab / WTK
View on GitHub
A Wasserstein Subsequence Kernel for Time Series.
☆21Jun 17, 2024Updated 2 years ago