mlabonne / tinytunerLinks
ππ§ A minimalistic tool to fine-tune your LLMs
β18Updated 2 years ago
Alternatives and similar repositories for tinytuner
Users that are interested in tinytuner are comparing it to the libraries listed below
Sorting:
- Using multiple LLMs for ensemble Forecastingβ16Updated 2 years ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.β34Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ72Updated last year
- Finetune any model on HF in less than 30 secondsβ56Updated last week
- Entailment self-trainingβ26Updated 2 years ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ46Updated last year
- QLoRA with Enhanced Multi GPU Supportβ37Updated 2 years ago
- ππ€ A collection of templates for Hugging Face Spacesβ35Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β32Updated 4 months ago
- Code for NeurIPS LLM Efficiency Challengeβ60Updated last year
- β56Updated last year
- Explore the use of DSPy for extracting features from PDFs πβ52Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Modelsβ70Updated 2 years ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β90Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β69Updated 2 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated last year
- β42Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ23Updated last year
- π¨ Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.β50Updated 2 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ32Updated last year
- β23Updated 2 years ago
- HuggingChat like UI in Gradioβ70Updated 2 years ago
- β125Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM datasetβ26Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPOβ29Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!β40Updated 2 years ago
- β66Updated this week
- β33Updated 2 years ago
- β20Updated 2 years ago