mlcommons / croissant
Croissant is a high-level format for machine learning datasets that brings together four rich layers.
β580Updated this week
Alternatives and similar repositories for croissant:
Users that are interested in croissant are comparing it to the libraries listed below
- skops is a Python library helping you share your scikit-learn based models and put them in productionβ477Updated this week
- Explore and interpret large embeddings in your browser with interactive visualization! πβ454Updated last year
- Let's build better datasets, together!β259Updated 4 months ago
- Interpretability for sequence generation models π πβ412Updated 5 months ago
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.β553Updated 10 months ago
- A list of awesome open source projects in the machine learning field, who's developers are mainly based in Germanyβ43Updated 7 months ago
- Website for hosting the Open Foundation Models Cheat Sheet.β267Updated last week
- Transform datasets at scale. Optimize datasets for fast AI model training.β449Updated last week
- Toolkit for attaching, training, saving and loading of new heads for transformer modelsβ275Updated last month
- Create powerful Hydra applications without the yaml files and boilerplate code.β375Updated last week
- The Foundation Model Transparency Indexβ78Updated 11 months ago
- Scalable and Performant Data Loadingβ240Updated this week
- Weave is a toolkit for developing AI-powered applications, built by Weights & Biases.β860Updated this week
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)β415Updated 6 months ago
- π A curated list of papers & technical articles on AI Quality & Safetyβ178Updated last week
- just a bunch of useful embeddings for scikit-learn pipelinesβ497Updated last month
- π€ A PyTorch library of curated Transformer models and their composable componentsβ884Updated last year
- The package used to build the documentation of our Hugging Face reposβ110Updated last week
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)β206Updated 2 weeks ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,360Updated last week
- git extension for {collaborative, communal, continual} model developmentβ211Updated 5 months ago
- Inspect: A framework for large language model evaluationsβ903Updated this week
- β221Updated last month
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to sβ¦β698Updated last month
- aim-mlflow integrationβ209Updated last year
- Stanford NLP Python library for understanding and improving PyTorch models via interventionsβ734Updated last week
- the scikit-learn sidekickβ403Updated this week
- β128Updated 3 weeks ago
- TokenSHAP: Explain individual token importance in large language model prompts with SHAP values. Gain insights, debug models, detect biasβ¦β40Updated 3 weeks ago
- β150Updated 8 months ago