☆219Feb 18, 2024Updated 2 years ago
Alternatives and similar repositories for LLM-finetuning-scripts
Users that are interested in LLM-finetuning-scripts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆48Apr 12, 2023Updated 3 years ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆151Jan 20, 2025Updated last year
- A place where I experiment with AI and share with a world☆24Apr 17, 2024Updated 2 years ago
- Machine Learning Q and AI book☆720Dec 17, 2025Updated 4 months ago
- A demo of the vanishing gradient problem in a simple fully connected network classifying MNIST images.☆15Jan 16, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,337Updated this week
- LoRA and DoRA from Scratch Implementations☆220Mar 5, 2024Updated 2 years ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- An LLM training library for instruction-tuning.☆26Mar 4, 2024Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆61Jan 27, 2025Updated last year
- ☆25Apr 1, 2026Updated last month
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆16Aug 26, 2020Updated 5 years ago
- ☆22Jan 5, 2024Updated 2 years ago
- Collection of useful machine learning codes and snippets (originally intended for my personal use)☆839Mar 21, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Comparing four automatic image augmentation techniques in PyTorch: AutoAugment, RandAugment, AugMix, and TrivialAugment☆31Feb 7, 2023Updated 3 years ago
- ☆170Jun 3, 2024Updated last year
- ☆78May 27, 2024Updated last year
- Building language models to predict more than one token ahead to enable further ahead predictions.☆12May 22, 2025Updated 11 months ago
- Streamline data pipelines for AI. Process datasets across 1000s of machines, and optimize data for blazing fast model training.☆16Sep 18, 2024Updated last year
- Classification of WAV files from cats and dogs☆21Dec 2, 2017Updated 8 years ago
- Demonstrations of the fundamental concepts from multivariable calculus☆12Aug 21, 2018Updated 7 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,084Jul 1, 2025Updated 10 months ago
- fast.ai APL study group notes☆29Oct 19, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆20Mar 25, 2025Updated last year
- ☆10May 21, 2023Updated 2 years ago
- A lightweight reimplementation of some of the algorithms in the MEME suite in Python.☆32Mar 17, 2026Updated last month
- ☆47Apr 6, 2024Updated 2 years ago
- Legal Entity Name Understanding☆22Sep 25, 2025Updated 7 months ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- A 4-hour coding workshop to understand how LLMs are implemented and used☆1,086Jan 13, 2025Updated last year
- Code examples for lecture series☆33Nov 4, 2024Updated last year
- Code for the paper "Language Models are Unsupervised Multitask Learners"☆16Sep 21, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python command line tools as productivity supplements for Posix systems☆17Apr 4, 2024Updated 2 years ago
- Code for "All-In-1: Short Text Classification with One Model for All Languages" - Plank (2017), IJCNLP 2017 shared task 4☆16Oct 26, 2017Updated 8 years ago
- A tiny library for coding with large language models.☆1,233Jul 10, 2024Updated last year
- ☆11Feb 13, 2024Updated 2 years ago
- Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024☆247May 15, 2024Updated last year
- LLM (Large Language Model) FineTuning☆575Apr 1, 2025Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆736Apr 10, 2024Updated 2 years ago