ml6team / fondant
Production-ready data processing made easy and shareable
☆337Updated 3 months ago
Related projects: ⓘ
- Domain Adapted Language Modeling Toolkit - E2E RAG☆295Updated 3 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆195Updated 4 months ago
- Explore and interpret large embeddings in your browser with interactive visualization! 📍☆401Updated 7 months ago
- ☆201Updated 7 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆237Updated last week
- ☆89Updated 11 months ago
- ☆409Updated 10 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆217Updated 6 months ago
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript☆545Updated 2 months ago
- Curate better data for LLMs☆934Updated 6 months ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆303Updated 9 months ago
- Diffusers-Interpret 🤗🧨🕵️♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.☆267Updated last year
- 🤖 A PyTorch library of curated Transformer models and their composable components☆861Updated 5 months ago
- ☆154Updated 3 months ago
- ☆429Updated 8 months ago
- run paligemma in real time☆122Updated 4 months ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆154Updated 8 months ago
- Transform datasets at scale. Optimize datasets for fast AI model training.☆318Updated this week
- data cleaning and curation for unstructured text☆326Updated last month
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆330Updated this week
- Python client library for Modal☆268Updated this week
- This is our own implementation of 'Layer Selective Rank Reduction'☆229Updated 3 months ago
- ☆182Updated 7 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆217Updated 2 months ago
- Tune any FALCON in 4-bit☆469Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆180Updated 8 months ago
- 🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.☆277Updated 4 months ago
- 📚 Datasets and models for instruction-tuning☆228Updated 11 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆150Updated 5 months ago
- 🦖 X—LLM: Cutting Edge & Easy LLM Finetuning☆371Updated 8 months ago