ml6team / fondantLinks
Production-ready data processing made easy and shareable
☆358Updated last year
Alternatives and similar repositories for fondant
Users that are interested in fondant are comparing it to the libraries listed below
Sorting:
- ☆198Updated last year
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.☆154Updated 2 years ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated 2 years ago
- 🤖 A PyTorch library of curated Transformer models and their composable components☆894Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆233Updated last year
- Let's build better datasets, together!☆269Updated last year
- ☆125Updated last year
- Smol but mighty language model☆65Updated 2 years ago
- ☆94Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Updated 2 years ago
- Small finetuned LLMs for a diverse set of useful tasks☆127Updated 2 years ago
- ☆470Updated 2 years ago
- Completion After Prompt Probability. Make your LLM make a choice☆82Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated 2 years ago
- Full finetuning of large language models without large memory requirements☆94Updated 4 months ago
- Tune MPTs☆84Updated 2 years ago
- Maybe the new state of the art vision model? we'll see 🤷♂️☆171Updated 2 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Updated 2 years ago
- git extension for {collaborative, communal, continual} model development☆217Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆333Updated last year
- data cleaning and curation for unstructured text☆328Updated last year
- Explore and interpret large embeddings in your browser with interactive visualization! 📍☆512Updated last week
- AI Data Management & Evaluation Platform☆215Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- ☆416Updated 2 years ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆294Updated 10 months ago