muellerzr / minimal-trainer-zooLinks
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆196Updated last year
Alternatives and similar repositories for minimal-trainer-zoo
Users that are interested in minimal-trainer-zoo are comparing it to the libraries listed below
Sorting:
- Highly commented implementations of Transformers in PyTorch☆136Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆230Updated 7 months ago
- ☆77Updated last year
- ☆170Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- experiments with inference on llama☆104Updated 11 months ago
- ☆92Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated 8 months ago
- ☆123Updated 7 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆122Updated last year
- ☆202Updated last year
- ☆198Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆136Updated last week
- ☆52Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆279Updated 3 months ago
- ☆152Updated 6 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated last year
- Large Language Model (LLM) Inference API and Chatbot☆125Updated last year
- Let's build better datasets, together!☆258Updated 5 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf☆199Updated last year
- A miniture AI training framework for PyTorch☆42Updated 4 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- A comprehensive deep dive into the world of tokens☆223Updated 11 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆258Updated 10 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆46Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆187Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆105Updated 2 months ago
- ☆77Updated 11 months ago
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year