☆218Feb 18, 2024Updated 2 years ago
Alternatives and similar repositories for LLM-finetuning-scripts
Users that are interested in LLM-finetuning-scripts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Supplementary material for "Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to Adapters"☆48Apr 12, 2023Updated 3 years ago
- Vision transformer finetuning scripts☆25Dec 8, 2023Updated 2 years ago
- ☆13Nov 19, 2023Updated 2 years ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,297Updated this week
- LoRA and DoRA from Scratch Implementations☆218Mar 5, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- An LLM training library for instruction-tuning.☆26Mar 4, 2024Updated 2 years ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆61Jan 27, 2025Updated last year
- ☆24Apr 1, 2026Updated last week
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆16Aug 26, 2020Updated 5 years ago
- Comparing four automatic image augmentation techniques in PyTorch: AutoAugment, RandAugment, AugMix, and TrivialAugment☆31Feb 7, 2023Updated 3 years ago
- ☆170Jun 3, 2024Updated last year
- ☆78May 27, 2024Updated last year
- Building language models to predict more than one token ahead to enable further ahead predictions.☆12May 22, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Streamline data pipelines for AI. Process datasets across 1000s of machines, and optimize data for blazing fast model training.☆16Sep 18, 2024Updated last year
- Repo for "Smart Word Suggestions" (SWS) task and benchmark☆19Dec 4, 2023Updated 2 years ago
- Demonstrations of the fundamental concepts from multivariable calculus☆12Aug 21, 2018Updated 7 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,079Jul 1, 2025Updated 9 months ago
- fast.ai APL study group notes☆29Oct 19, 2022Updated 3 years ago
- ☆10May 21, 2023Updated 2 years ago
- pandasGWAS: a Python package for easy retrieval of GWAS Catalog data☆17Dec 19, 2025Updated 3 months ago
- A lightweight reimplementation of some of the algorithms in the MEME suite in Python.☆32Mar 17, 2026Updated 3 weeks ago
- ☆25Jul 10, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆47Apr 6, 2024Updated 2 years ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- A 4-hour coding workshop to understand how LLMs are implemented and used☆1,082Jan 13, 2025Updated last year
- WizardLM: Empowering Large Pre-Trained Language Models to Follow Complex Instructions☆20May 1, 2023Updated 2 years ago
- A Recommendation engine for an e-commerce use case that provides recommendations to users based on their purchase history.☆21Jul 10, 2021Updated 4 years ago
- Code for the paper "Language Models are Unsupervised Multitask Learners"☆16Sep 21, 2024Updated last year
- Website for Applied-LLMs work☆29Jan 13, 2026Updated 3 months ago
- Code for "All-In-1: Short Text Classification with One Model for All Languages" - Plank (2017), IJCNLP 2017 shared task 4☆16Oct 26, 2017Updated 8 years ago
- A tiny library for coding with large language models.☆1,233Jul 10, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Feb 13, 2024Updated 2 years ago
- Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024☆247May 15, 2024Updated last year
- ☆197May 5, 2024Updated last year
- Tuning the Finetuning: An exploration of achieving success with QLoRA☆46May 9, 2024Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆736Apr 10, 2024Updated 2 years ago
- Code for "Proposition-Level Clustering for Multi-Document Summarization" paper☆10Apr 5, 2024Updated 2 years ago
- Comparing performance across many methodological dimensions among tools that predict RNA after TF knockdowns and overexpression.☆22Sep 12, 2025Updated 7 months ago