mlcommons / croissant
Croissant is a high-level format for machine learning datasets that brings together four rich layers.
☆499Updated last week
Alternatives and similar repositories for croissant:
Users that are interested in croissant are comparing it to the libraries listed below
- Inspect: A framework for large language model evaluations☆746Updated this week
- Website for hosting the Open Foundation Models Cheat Sheet.☆263Updated 7 months ago
- Create powerful Hydra applications without the yaml files and boilerplate code.☆354Updated this week
- Transform datasets at scale. Optimize datasets for fast AI model training.☆406Updated last week
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,723Updated last month
- ☆206Updated this week
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆380Updated last week
- Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.☆791Updated this week
- My personal frontpage app☆82Updated this week
- Interpretability for sequence generation models 🐛 🔍☆394Updated 2 months ago
- A project for training foundational Danish language model☆70Updated 3 weeks ago
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.☆539Updated 8 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,169Updated this week
- A visual labeling system implemented in Jupyter widgets.☆150Updated 2 months ago
- DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤☆914Updated this week
- A Jax-based library for designing and training transformer models from scratch.☆280Updated 5 months ago
- ☆118Updated 2 weeks ago
- The package used to build the documentation of our Hugging Face repos☆99Updated last week
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.☆106Updated this week
- Shapley Interactions and Shapley Values for Machine Learning☆314Updated last week
- 📝 Automatically annotate papers using LLMs☆281Updated last month
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆63Updated 11 months ago
- ☆259Updated last week
- AI Data Management & Evaluation Platform☆215Updated last year
- skops is a Python library helping you share your scikit-learn based models and put them in production☆463Updated this week
- Scalable data pre processing and curation toolkit for LLMs☆766Updated this week
- Let's build better datasets, together!☆250Updated last month
- 📚 A curated list of papers & technical articles on AI Quality & Safety☆165Updated last year
- End-to-end Generative Optimization for AI Agents☆458Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,032Updated this week