ml6team / fondant
Production-ready data processing made easy and shareable
β351Updated 10 months ago
Alternatives and similar repositories for fondant:
Users that are interested in fondant are comparing it to the libraries listed below
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ198Updated 11 months ago
- π€ A PyTorch library of curated Transformer models and their composable componentsβ883Updated last year
- β199Updated last year
- Diffusers-Interpret π€π§¨π΅οΈββοΈ: Model explainability for π€ Diffusers. Get explanations for your generated images.β275Updated 2 years ago
- [WIP] A π₯ interface for running code in the cloudβ85Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.β152Updated last year
- Let's build better datasets, together!β259Updated 3 months ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...β317Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAGβ320Updated 5 months ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.β152Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.β157Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ231Updated 5 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- just a bunch of useful embeddings for scikit-learn pipelinesβ496Updated 3 weeks ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ223Updated 11 months ago
- Updated last month
- β‘οΈ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.β141Updated 10 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer modelsβ273Updated last month
- β50Updated last year
- Playing around with stable diffusion. Generated images are reproducible because I save the metadata and latent information. You can generβ¦β208Updated 2 years ago
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desiβ¦β163Updated 2 years ago
- run paligemma in real timeβ131Updated 11 months ago
- π Datasets and models for instruction-tuningβ238Updated last year
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many differentβ¦β286Updated 4 months ago
- β78Updated 10 months ago
- β451Updated last year
- Manage scalable open LLM inference endpoints in Slurm clustersβ254Updated 9 months ago
- Explore and interpret large embeddings in your browser with interactive visualization! πβ454Updated last year
- β167Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first appβ¦β167Updated last year