codalab / codabenchLinks
Codabench is a flexible, easy-to-use and reproducible benchmarking platform. Check our paper at Patterns Cell Press https://hubs.li/Q01fwRWB0
☆120Updated this week
Alternatives and similar repositories for codabench
Users that are interested in codabench are comparing it to the libraries listed below
Sorting:
- Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.☆90Updated this week
 - Official Python client library for the OpenReview API☆206Updated this week
 - Visualizing query-key interactions in language + vision transformers (VIS 2023)☆155Updated last year
 - Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆68Updated last year
 - Scrape papers from OpenReview using OpenReview API☆52Updated 8 months ago
 - PAL: Predictive Analysis & Laws of Large Language Models☆38Updated 9 months ago
 - Discovering Data-driven Hypotheses in the Wild☆114Updated 4 months ago
 - PyTorch Explain: Interpretable Deep Learning in Python.☆163Updated last year
 - Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆121Updated last year
 - Research on Tabular Foundation Models☆58Updated 10 months ago
 - AIRA-dojo: a framework for developing and evaluating AI research agents☆106Updated last month
 - An annotated implementation of the Hyena Hierarchy paper☆34Updated 2 years ago
 - ☆117Updated 2 years ago
 - An open benchmarking platform for medical artificial intelligence using Federated Evaluation.☆163Updated last month
 - Decomposing and Editing Predictions by Modeling Model Computation☆138Updated last year
 - A curated list of awesome open source tools and commercial products for ML Experiment Tracking and Management 🚀☆147Updated last year
 - Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆132Updated 11 months ago
 - ☆230Updated this week
 - Training and evaluating NBM and SPAM for interpretable machine learning.☆78Updated 2 years ago
 - Turn GitHub repositories into LLM tools. (ACL 2025)☆53Updated 5 months ago
 - NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records☆47Updated last month
 - Dataset and modelling infrastructure for modelling "event streams": sequences of continuous time, multivariate events with complex intern…☆115Updated 4 months ago
 - [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆106Updated 2 months ago
 - ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆117Updated 4 months ago
 - Official source code for "Graph Neural Networks for Learning Equivariant Representations of Neural Networks". In ICLR 2024 (oral).☆83Updated last year
 - Code for ICML 2025 paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning☆24Updated 4 months ago
 - ☆30Updated 2 years ago
 - A toolkit for quantitative evaluation of data attribution methods.☆53Updated 3 months ago
 - ☆139Updated 2 years ago
 - Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)☆45Updated 11 months ago