pacman100 / LLM-WorkshopLinks
LLM Workshop by Sourab Mangrulkar
☆394Updated last year
Alternatives and similar repositories for LLM-Workshop
Users that are interested in LLM-Workshop are comparing it to the libraries listed below
Sorting:
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated last year
- Best practices for distilling large language models.☆576Updated last year
- Automatically evaluate your LLMs in Google Colab☆661Updated last year
- Official repository for ORPO☆464Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- An open collection of methodologies to help with successful training of large language models.☆512Updated last year
- ☆541Updated 10 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆126Updated 2 years ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆274Updated last year
- A bagel, with everything.☆324Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆505Updated last year
- batched loras☆346Updated 2 years ago
- awesome synthetic (text) datasets☆297Updated 3 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆720Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆189Updated last year
- An open collection of implementation tips, tricks and resources for training large language models☆481Updated 2 years ago
- The official evaluation suite and dynamic data release for MixEval.☆249Updated 11 months ago
- Let's build better datasets, together!☆262Updated 9 months ago
- ☆216Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆288Updated 7 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆492Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆240Updated 11 months ago
- Easily embed, cluster and semantically label text datasets☆578Updated last year
- ☆463Updated last year
- A repository for research on medium sized language models.☆511Updated 4 months ago
- Collection of links, tutorials and best practices of how to collect the data and build end-to-end RLHF system to finetune Generative AI m…☆223Updated 2 years ago
- A comprehensive deep dive into the world of tokens☆226Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆311Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 11 months ago
- distributed trainer for LLMs☆580Updated last year