ml6team / fondantLinks
Production-ready data processing made easy and shareable
☆358Updated last year
Alternatives and similar repositories for fondant
Users that are interested in fondant are comparing it to the libraries listed below
Sorting:
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago
- ☆198Updated last year
- Drop in replacement for OpenAI, but with Open models.☆153Updated 2 years ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆170Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆233Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- 🤗🖼️ HuggingPics: Fine-tune Vision Transformers for anything using images found on the web.☆310Updated last year
- ☆51Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆334Updated last year
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆322Updated 2 years ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆158Updated 2 years ago
- Smol but mighty language model☆63Updated 2 years ago
- 📚 Datasets and models for instruction-tuning☆238Updated 2 years ago
- Small finetuned LLMs for a diverse set of useful tasks☆127Updated 2 years ago
- Let's build better datasets, together!☆267Updated last year
- ☆469Updated 2 years ago
- data cleaning and curation for unstructured text☆328Updated last year
- ☆172Updated 10 months ago
- ⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.☆146Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Inference code for Persimmon-8B☆412Updated 2 years ago
- ☆94Updated 2 years ago
- Understanding large language models☆120Updated 2 years ago
- The repository for the code of the UltraFastBERT paper☆520Updated last year
- ☆125Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Updated 3 months ago