ml6team / fondantLinks
Production-ready data processing made easy and shareable
☆358Updated last year
Alternatives and similar repositories for fondant
Users that are interested in fondant are comparing it to the libraries listed below
Sorting:
- ☆198Updated 2 years ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆294Updated 11 months ago
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago
- Explore and interpret large embeddings in your browser with interactive visualization! 📍☆514Updated 2 weeks ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆322Updated 2 years ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.☆156Updated 2 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- ☆94Updated 2 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- Tune MPTs☆84Updated 2 years ago
- Smol but mighty language model☆65Updated 2 years ago
- 📚 Datasets and models for instruction-tuning☆238Updated 2 years ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Let's build better datasets, together!☆269Updated last year
- Inference code for Persimmon-8B☆412Updated 2 years ago
- data cleaning and curation for unstructured text☆328Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated 4 months ago
- ⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.☆147Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆333Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago
- ☆472Updated 2 years ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆171Updated 2 years ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆158Updated 2 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained cont…☆72Updated last month
- Small finetuned LLMs for a diverse set of useful tasks☆127Updated 2 years ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆183Updated 3 months ago