rasbt / LLM-finetuning-scripts
☆191Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for LLM-finetuning-scripts
- Sample notebooks and prompts for LLM evaluation☆114Updated this week
- A set of scripts and notebooks on LLM finetunning and dataset creation☆93Updated last month
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆195Updated 6 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆246Updated 2 weeks ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆118Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆66Updated this week
- ☆75Updated 5 months ago
- ☆162Updated 5 months ago
- Resources relating to the DLAI event: https://www.youtube.com/watch?v=eTieetk2dSw☆181Updated last year
- Materials for workshops on the Hugging Face ecosystem☆150Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆311Updated last week
- ☆91Updated last year
- Let's build better datasets, together!☆205Updated this week
- Best practices for distilling large language models.☆397Updated 9 months ago
- Large Language Model (LLM) Inference API and Chatbot☆122Updated 7 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆97Updated 7 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆469Updated last year
- ☆47Updated 5 months ago
- Highly commented implementations of Transformers in PyTorch☆128Updated last year
- An open collection of implementation tips, tricks and resources for training large language models☆460Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆162Updated 6 months ago
- Easily embed, cluster and semantically label text datasets☆462Updated 7 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆221Updated 2 weeks ago
- experiments with inference on llama☆105Updated 5 months ago
- ☆200Updated 9 months ago
- A comprehensive deep dive into the world of tokens☆214Updated 4 months ago
- Various installation guides for Large Language Models☆53Updated last week
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆315Updated 3 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆100Updated 9 months ago