pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆363Updated 7 months ago
Alternatives and similar repositories for LLM-Workshop:
Users that are interested in LLM-Workshop are comparing it to the libraries listed below
- A set of scripts and notebooks on LLM finetunning and dataset creation☆101Updated 4 months ago
- An Open Source Toolkit For LLM Distillation☆442Updated 3 weeks ago
- Official repository for ORPO☆432Updated 7 months ago
- Best practices for distilling large language models.☆431Updated 11 months ago
- ☆797Updated this week
- Automatically evaluate your LLMs in Google Colab☆583Updated 8 months ago
- ☆489Updated 2 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆123Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆450Updated 10 months ago
- A bagel, with everything.☆315Updated 9 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆255Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆493Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆200Updated 2 months ago
- awesome synthetic (text) datasets☆256Updated 3 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆694Updated 4 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆687Updated 9 months ago
- ☆497Updated 5 months ago
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆581Updated 10 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆249Updated 6 months ago
- batched loras☆338Updated last year
- distributed trainer for LLMs☆555Updated 8 months ago
- Easily embed, cluster and semantically label text datasets☆494Updated 10 months ago
- Generative Representational Instruction Tuning☆588Updated last week
- Minimalistic large language model 3D-parallelism training☆1,400Updated this week
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆347Updated 4 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,022Updated this week
- The official evaluation suite and dynamic data release for MixEval.☆233Updated 2 months ago
- Let's build better datasets, together!☆250Updated last month
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆431Updated 5 months ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆894Updated this week