mlcommons / croissantLinks
Croissant is a high-level format for machine learning datasets that brings together four rich layers.
☆787Updated this week
Alternatives and similar repositories for croissant
Users that are interested in croissant are comparing it to the libraries listed below
Sorting:
- Speed up model training by fixing data loading.☆574Updated this week
- skops is a Python library helping you share your scikit-learn based models and put them in production☆512Updated 3 weeks ago
- Explore and interpret large embeddings in your browser with interactive visualization! 📍☆512Updated last week
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.☆565Updated last year
- AI Data Management & Evaluation Platform☆215Updated 2 years ago
- Synthetic Data SDK ✨☆707Updated 2 weeks ago
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆458Updated 5 months ago
- ML has an impact on the climate. But not all models are born equal. Compute your model's emissions with our calculator and add the result…☆250Updated last year
- Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.☆93Updated last week
- Creating beautiful plots of data maps☆973Updated 2 weeks ago
- Create powerful Hydra applications without the yaml files and boilerplate code.☆448Updated last week
- Neo: Hierarchical Confusion Matrix Visualization (CHI 2022)☆315Updated last month
- ☆259Updated 2 months ago
- A curated list of awesome open source tools and commercial products for ML Experiment Tracking and Management 🚀☆156Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆269Updated 8 months ago
- Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.☆1,049Updated this week
- The Data Cards Playbook helps dataset producers and publishers adopt a people-centered approach to transparency in dataset documentation.☆198Updated last year
- Interactively explore unstructured datasets from your dataframe.☆1,245Updated last week
- Inspect: A framework for large language model evaluations☆1,712Updated this week
- The Foundation Model Transparency Index☆85Updated last month
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆737Updated this week
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,234Updated last week
- The robust European language model benchmark.☆156Updated this week
- just a bunch of useful embeddings for scikit-learn pipelines☆520Updated 4 months ago
- Scalable data pre processing and curation toolkit for LLMs☆1,377Updated this week
- A visual labeling system implemented in Jupyter widgets.☆154Updated last year
- A Python package housing a collection of deep-learning multi-modal data fusion method pipelines! From data loading, to training, to evalu…☆197Updated 6 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,855Updated 7 months ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆894Updated last year
- Let's build better datasets, together!☆269Updated last year