mlcommons / croissant
Croissant is a high-level format for machine learning datasets that brings together four rich layers.
☆608Updated this week
Alternatives and similar repositories for croissant
Users that are interested in croissant are comparing it to the libraries listed below
Sorting:
- Transform datasets at scale. Optimize datasets for fast AI model training.☆472Updated this week
- An example starter repo for Python projects☆288Updated last month
- SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.☆762Updated last month
- Stanford NLP Python library for understanding and improving PyTorch models via interventions☆742Updated 2 weeks ago
- Inspect: A framework for large language model evaluations☆938Updated this week
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated last week
- AI Data Management & Evaluation Platform☆215Updated last year
- Explore and interpret large embeddings in your browser with interactive visualization! 📍☆455Updated last year
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆209Updated last month
- An interactive HTML pretty-printer for machine learning research in IPython notebooks.☆414Updated 2 weeks ago
- 📝 Automatically annotate papers using LLMs☆320Updated 3 weeks ago
- LLM Comparator is an interactive data visualization tool for evaluating and analyzing LLM responses side-by-side, developed by the PAIR t…☆423Updated 3 months ago
- Tools for understanding how transformer predictions are built layer-by-layer☆490Updated 11 months ago
- git extension for {collaborative, communal, continual} model development☆213Updated 6 months ago
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)☆419Updated 7 months ago
- Data and tools for generating and inspecting OLMo pre-training data.☆1,214Updated this week
- The WeightWatcher tool for predicting the accuracy of Deep Neural Networks☆1,592Updated 8 months ago
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.☆559Updated 11 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,779Updated 3 weeks ago
- ☆233Updated last month
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆278Updated 2 months ago
- Create powerful Hydra applications without the yaml files and boilerplate code.☆377Updated last week
- awesome synthetic (text) datasets☆281Updated 6 months ago
- Synthetic Data SDK ✨☆504Updated this week
- Interpretability for sequence generation models 🐛 🔍☆413Updated 3 weeks ago
- Visualizing query-key interactions in language + vision transformers☆144Updated last year
- skops is a Python library helping you share your scikit-learn based models and put them in production☆480Updated 3 weeks ago
- ☆267Updated 3 months ago
- ☆129Updated last month
- Train Models Contrastively in Pytorch☆702Updated last month