ml6team / fondantLinks
Production-ready data processing made easy and shareable
☆354Updated last year
Alternatives and similar repositories for fondant
Users that are interested in fondant are comparing it to the libraries listed below
Sorting:
- ☆199Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆168Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆159Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆282Updated 4 months ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.☆152Updated 2 years ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆324Updated 8 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 8 months ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Let's build better datasets, together!☆260Updated 6 months ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆165Updated last year
- ☆124Updated 8 months ago
- ☆92Updated last year
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆317Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆310Updated 2 years ago
- batched loras☆344Updated last year
- ☆154Updated 7 months ago
- Tune MPTs☆84Updated 2 years ago
- A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs☆114Updated 2 years ago
- ☆460Updated last year
- git extension for {collaborative, communal, continual} model development☆214Updated 8 months ago
- Full finetuning of large language models without large memory requirements☆94Updated last year
- Place where folks can contribute to 🤗 community events☆424Updated last year
- ☆416Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- experiments with inference on llama☆104Updated last year
- 📚 Datasets and models for instruction-tuning☆238Updated last year
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆153Updated last year