Every Eval Ever is a shared schema and crowdsourced eval database. It defines a standardized metadata format for storing AI evaluation results — from leaderboard scrapes and research papers to local evaluation runs — so that results from different frameworks can be compared, reproduced, and reused.
☆43Mar 21, 2026Updated this week
Alternatives and similar repositories for every_eval_ever
Users that are interested in every_eval_ever are comparing it to the libraries listed below
Sorting:
- 🤗 Tokenizers.js: A pure JS/TS implementation of today's most used tokenizers☆40Updated this week
- The repo consists of a Python package that works with functional data. In particular, it includes two distinct methodologies: Functional …☆13Sep 18, 2025Updated 6 months ago
- TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions☆12May 18, 2017Updated 8 years ago
- ☆13Jul 13, 2025Updated 8 months ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.☆12Mar 23, 2023Updated 2 years ago
- Proof Of Concept showcasing composable GPUs in Kubernetes☆18Mar 10, 2026Updated last week
- ☆13Mar 30, 2023Updated 2 years ago
- ☆18Jul 3, 2023Updated 2 years ago
- Self-personalizing LM☆76Mar 6, 2026Updated 2 weeks ago
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆24Sep 4, 2024Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆23Aug 29, 2022Updated 3 years ago
- This comprehensive collection of notebooks serves as a valuable resource for learners pursuing the IBM AI Engineering Professional Certi…☆15Dec 31, 2023Updated 2 years ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- Supplemental to A Probabilistic Grammar of Graphics☆14Dec 23, 2022Updated 3 years ago
- An official PyTorch implementation for CLIPPR☆30Jul 22, 2023Updated 2 years ago
- Responsible Prompting is an LLM-agnostic tool that aims at dynamically supporting users in crafting prompts that embed responsible intent…☆45Jan 26, 2026Updated last month
- Lab files of IBM's Qiskit Global Summer School 2020.☆17Sep 3, 2020Updated 5 years ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆151Oct 2, 2025Updated 5 months ago
- Annotatability, a method to identify meaningful patterns in single-cell genomics data through annotation-trainability analysis, which est…☆19Jun 23, 2025Updated 8 months ago
- Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.☆19Oct 3, 2024Updated last year
- Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024☆18Mar 25, 2025Updated 11 months ago
- Example project showing how you can use your fast.ai based scripts to let Amazon SageMaker perform the training and hosting of your model…☆14Feb 20, 2019Updated 7 years ago
- Crosslingual Reasoning through Test-Time Scaling☆19May 13, 2025Updated 10 months ago
- AutoML system for building trustworthy peptide bioactivity predictors☆37Mar 2, 2026Updated 2 weeks ago
- Build a Docker container to build, train and deploy fast.ai based Deep Learning models with Amazon SageMaker☆13Dec 15, 2018Updated 7 years ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆19Jul 27, 2025Updated 7 months ago
- Repository for "Attribute First, then Generate: Locally-attributable Grounded Text Generation", ACL 2024☆30Dec 19, 2024Updated last year
- Official PyTorch implementation for ״ lassification-Regression for Chart Comprehension״☆26Feb 5, 2025Updated last year
- Deploy automl models for tabular tasks on AWS Sagemaker with AutoGluon☆13Feb 28, 2020Updated 6 years ago
- Ontology representing a 360-view of a person (or cohort) that spans across multiple domains, from health to social.☆35Sep 17, 2025Updated 6 months ago
- ☆22Jul 16, 2024Updated last year
- Progress component for Slidev☆19Apr 2, 2024Updated last year
- ☆39May 21, 2023Updated 2 years ago
- R and Data Files from my YouTube Channel☆29Aug 19, 2025Updated 7 months ago
- Example repo for how to organize your PhD with Github☆28Oct 25, 2015Updated 10 years ago
- Project exploring 3D volumetric rendering of NEXRAD radar data.☆11Oct 23, 2023Updated 2 years ago
- A curated reading list of research in Sparse Autoencoders, Feature Extraction and related topics in Mechanistic Interpretability☆30Jan 30, 2025Updated last year
- TrustyAI Explainability Toolkit☆55Mar 2, 2026Updated 2 weeks ago
- ✒️ A gallery of experiments with Scalable Vector Graphics (SVG) and interactive visualizations.☆13Jan 6, 2023Updated 3 years ago