tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆104Updated 5 months ago
Alternatives and similar repositories for llm_recipes:
Users that are interested in llm_recipes are comparing it to the libraries listed below
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆230Updated 4 months ago
- ☆92Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆198Updated 9 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆255Updated last year
- Let's build better datasets, together!☆256Updated 2 months ago
- Set of scripts to finetune LLMs☆36Updated 11 months ago
- experiments with inference on llama☆104Updated 8 months ago
- LLM Workshop by Sourab Mangrulkar☆366Updated 8 months ago
- ☆113Updated 5 months ago
- The official evaluation suite and dynamic data release for MixEval.☆231Updated 3 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 4 months ago
- ☆87Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆264Updated last month
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆253Updated 7 months ago
- ☆21Updated 4 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 9 months ago
- awesome synthetic (text) datasets☆263Updated 4 months ago
- Sample notebooks and prompts for LLM evaluation☆120Updated 3 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆274Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 7 months ago
- LoRA and DoRA from Scratch Implementations☆197Updated 11 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆67Updated last year
- Prune transformer layers☆68Updated 9 months ago
- End-to-End LLM Guide☆103Updated 8 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆203Updated 4 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆136Updated 7 months ago
- A miniture AI training framework for PyTorch☆39Updated last month