mlabonne / tinytuner
ππ§ A minimalistic tool to fine-tune your LLMs
β17Updated last year
Related projects β
Alternatives and complementary repositories for tinytuner
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.β33Updated 8 months ago
- π¦ XβLLM: Simple & Cutting Edge LLM Finetuningβ11Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM datasetβ13Updated 8 months ago
- β24Updated last year
- implementation of https://arxiv.org/pdf/2312.09299β19Updated 4 months ago
- Explore the use of DSPy for extracting features from PDFs πβ33Updated 8 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ37Updated 7 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zetaβ13Updated last week
- Training hybrid models for dummies.β15Updated 3 weeks ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β23Updated last week
- β41Updated 2 weeks ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptationsβ33Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ23Updated 8 months ago
- β20Updated 9 months ago
- β32Updated last year
- Using multiple LLMs for ensemble Forecastingβ16Updated 10 months ago
- ππ€ A collection of templates for Hugging Face Spacesβ35Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.β20Updated 5 months ago
- Tutorial for DSPyβ21Updated 6 months ago
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!β20Updated last week
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ34Updated last year
- BH hackathonβ14Updated 7 months ago
- Tools for merging pretrained large language models.β19Updated 5 months ago
- β32Updated 9 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β43Updated 2 months ago
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't relβ¦β11Updated 9 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale β¦β11Updated last week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β61Updated 2 weeks ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrievalβ14Updated 10 months ago