muellerzr / minimal-trainer-zoo
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆198Updated 10 months ago
Alternatives and similar repositories for minimal-trainer-zoo:
Users that are interested in minimal-trainer-zoo are comparing it to the libraries listed below
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆230Updated 4 months ago
- Highly commented implementations of Transformers in PyTorch☆132Updated last year
- ☆165Updated 9 months ago
- ☆92Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆150Updated 3 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆105Updated 5 months ago
- ☆77Updated 9 months ago
- ☆120Updated 4 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆125Updated 3 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- ☆208Updated 8 months ago
- Let's build better datasets, together!☆256Updated 3 months ago
- ☆51Updated 9 months ago
- A miniture AI training framework for PyTorch☆39Updated last month
- ☆195Updated 10 months ago
- experiments with inference on llama☆104Updated 9 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 11 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆265Updated 2 weeks ago
- ☆200Updated last year
- ☆199Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆253Updated 8 months ago
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- Helpers and such for working with Lambda Cloud☆51Updated last year
- Large Language Model (LLM) Inference API and Chatbot☆124Updated 11 months ago
- Fast bare-bones BPE for modern tokenizer training☆149Updated 5 months ago
- Late Interaction Models Training & Retrieval☆259Updated this week
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆38Updated 11 months ago