ml6team / fondantLinks
Production-ready data processing made easy and shareable
โ352Updated last year
Alternatives and similar repositories for fondant
Users that are interested in fondant are comparing it to the libraries listed below
Sorting:
- โ199Updated last year
- Diffusers-Interpret ๐ค๐งจ๐ต๏ธโโ๏ธ: Model explainability for ๐ค Diffusers. Get explanations for your generated images.โ277Updated 2 years ago
- data cleaning and curation for unstructured textโ327Updated 10 months ago
- โ203Updated last year
- Let's build better datasets, together!โ260Updated 6 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesโ197Updated last year
- [WIP] A ๐ฅ interface for running code in the cloudโ85Updated 2 years ago
- SpanMarker for Named Entity Recognitionโ434Updated 5 months ago
- โ415Updated last year
- Neural Searchโ358Updated 3 months ago
- just a bunch of useful embeddings for scikit-learn pipelinesโ500Updated 3 months ago
- โ92Updated last year
- โ455Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.โ309Updated 2 years ago
- Explore and interpret large embeddings in your browser with interactive visualization! ๐โ465Updated last year
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...โ317Updated last year
- Inference code for Persimmon-8Bโ415Updated last year
- A Simple Bulk Labelling Toolโ587Updated 5 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAGโ322Updated 7 months ago
- FastFit โก When LLMs are Unfit Use FastFit โก Fast and Effective Text Classification with Many Classesโ208Updated last month
- ๐ค A PyTorch library of curated Transformer models and their composable componentsโ892Updated last year
- A library for detecting problematic data segments in structured and unstructured data with few lines of code.โ64Updated last year
- ๐ Datasets and models for instruction-tuningโ238Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.โ152Updated last year
- ๐ฆ An NLP application just for the lols: built with Haystack to get an overview of what a user is posting about on Twitterโ44Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).โฆโ121Updated 2 years ago
- โ170Updated last year
- ๐ Reference-Free automatic summarization evaluation with potential hallucination detectionโ100Updated last year
- Helpers and such for working with Lambda Cloudโ51Updated last year
- Understanding large language modelsโ117Updated 2 years ago