codalab / codabenchLinks
Codabench is a flexible, easy-to-use and reproducible benchmarking platform. Check our paper at Patterns Cell Press https://hubs.li/Q01fwRWB0
β130Updated this week
Alternatives and similar repositories for codabench
Users that are interested in codabench are comparing it to the libraries listed below
Sorting:
- Official Python client library for the OpenReview APIβ222Updated this week
- A curated list of awesome open source tools and commercial products for ML Experiment Tracking and Management πβ156Updated last year
- PyTorch Explain: Interpretable Deep Learning in Python.β168Updated last year
- Visualizing query-key interactions in language + vision transformers (VIS 2023)β158Updated last year
- Scrape papers from OpenReview using OpenReview APIβ61Updated 10 months ago
- Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.β93Updated this week
- Training and evaluating NBM and SPAM for interpretable machine learning.β78Updated 2 years ago
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!β120Updated 2 years ago
- Research on Tabular Foundation Modelsβ68Updated last year
- β238Updated last month
- An open benchmarking platform for medical artificial intelligence using Federated Evaluation.β163Updated 3 weeks ago
- β148Updated 6 months ago
- Discovering Data-driven Hypotheses in the Wildβ127Updated 7 months ago
- Testing Language Models for Memorization of Tabular Datasets.β36Updated 11 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmindβ72Updated last year
- Dataset and modelling infrastructure for modelling "event streams": sequences of continuous time, multivariate events with complex internβ¦β115Updated 6 months ago
- Updated code base for GlanceNets: Interpretable, Leak-proof Concept-based modelsβ25Updated 2 years ago
- A collection of AWESOME language modeling techniques on tabular data applications.β32Updated last year
- W&B Server is the self hosted version of Weights & Biasesβ342Updated last week
- β257Updated last week
- TemporAI: ML-centric Toolkit for Medical Time Seriesβ127Updated 2 years ago
- Uncertainty-aware representation learning (URL) benchmarkβ106Updated 10 months ago
- PAL: Predictive Analysis & Laws of Large Language Modelsβ38Updated last year
- End-to-End Ontology Learning with Large Language Models, NeurIPS 2024.β46Updated last year
- [NeurIPS 2024] π§Όπ A holistic self-supervised data cleaning strategy to detect irrelevant samples, near duplicates and label errors.β36Updated 3 months ago
- Tabular In-Context Learningβ104Updated 10 months ago
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.β133Updated last year
- Exca - Execution and caching tool for pythonβ113Updated last week
- TorchDR - PyTorch Dimensionality Reductionβ183Updated last week
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"β31Updated 7 months ago