taprosoft/llm_finetuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/taprosoft/llm_finetuning)

taprosoft / llm_finetuning

Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes)

☆143

Alternatives and similar repositories for llm_finetuning

Users that are interested in llm_finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

midrender / haven
View on GitHub
LLM fine-tuning and eval
☆345Mar 21, 2024Updated 2 years ago
pytest-visual / pytest-visual
View on GitHub
A pytest plugin to organize and track algorithm visualizations
☆18Dec 1, 2024Updated last year
turboderp / exllama
View on GitHub
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆2,934Sep 30, 2023Updated 2 years ago
Nuggt-dev / Nuggt
View on GitHub
An Autonomous LLM Agent that runs on Wizcoder-15B
☆335Oct 21, 2024Updated last year
eugenepentland / landmark-attention-qlora
View on GitHub
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Jun 16, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mzbac / qlora-fine-tune
View on GitHub
☆166Jun 1, 2023Updated 3 years ago
pickaxeproject / llama2
View on GitHub
Our Process for Llama2 Finetuning
☆16Sep 8, 2023Updated 2 years ago
soochan-lee / RoT
View on GitHub
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…
☆45Jun 13, 2023Updated 3 years ago
ChrisHayduk / qlora-multi-gpu
View on GitHub
QLoRA with Enhanced Multi GPU Support
☆38Aug 8, 2023Updated 2 years ago
ghomasHudson / muld
View on GitHub
The Multitask Long Document Benchmark
☆42Nov 2, 2022Updated 3 years ago
jondurbin / airoboros
View on GitHub
Customizable implementation of the self-instruct paper.
☆1,051Mar 7, 2024Updated 2 years ago
mzbac / qlora-inference-multi-gpu
View on GitHub
☆14May 25, 2023Updated 3 years ago
Maximilian-Winter / guidance
View on GitHub
A guidance language for controlling large language models.
☆43Jun 9, 2023Updated 3 years ago
uukuguy / multi_loras
View on GitHub
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…
☆162Feb 9, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bigcode-project / octopack
View on GitHub
🐙 OctoPack: Instruction Tuning Code Large Language Models
☆479Feb 5, 2025Updated last year
discus-labs / discus
View on GitHub
A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ
☆61Nov 20, 2023Updated 2 years ago
mzbac / AutoGPTQ-API
View on GitHub
Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.
☆91Jun 19, 2023Updated 3 years ago
ElleLeonne / Lightning-ReLoRA
View on GitHub
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆34Mar 2, 2024Updated 2 years ago
abacaj / fine-tune-mistral
View on GitHub
Fine-tune mistral-7B on 3090s, a100s, h100s
☆735Oct 11, 2023Updated 2 years ago
epfml / landmark-attention
View on GitHub
Landmark Attention: Random-Access Infinite Context Length for Transformers
☆426Dec 20, 2023Updated 2 years ago
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,242Updated this week
VatsaDev / NanoPhi-alpha
View on GitHub
GPT-2 small trained on phi-like data
☆68Feb 18, 2024Updated 2 years ago
OpenAccess-AI-Collective / servereless-runpod-ggml
View on GitHub
☆53Jun 11, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sasha0552 / vllm-ci
View on GitHub
CI scripts designed to build a Pascal-compatible version of vLLM.
☆13Aug 10, 2024Updated last year
severian42 / Proteus-The-Genesis-LLM
View on GitHub
Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine
☆25Dec 20, 2024Updated last year
rmihaylov / falcontune
View on GitHub
Tune any FALCON in 4-bit
☆462Sep 1, 2023Updated 2 years ago
nqchieutb01 / vietnamese-sentence-paraphase
View on GitHub
paraphase sentence
☆11Aug 22, 2025Updated 11 months ago
kanttouchthis / text_generation_webui_xtts
View on GitHub
XTTSv2 Extension for oobabooga text-generation-webui
☆157Nov 21, 2023Updated 2 years ago
euclaise / SlimTrainer
View on GitHub
Full finetuning of large language models without large memory requirements
☆92Sep 22, 2025Updated 10 months ago
ChobPT / oobaboogas-webui-langchain_agent
View on GitHub
Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work
☆73Sep 12, 2023Updated 2 years ago
kayvr / token-hawk
View on GitHub
WebGPU LLM inference tuned by hand
☆150Jun 24, 2023Updated 3 years ago
mzbac / GPTQ-for-LLaMa-API
View on GitHub
Provide a way to use the GPT-QLLama model as an API
☆44May 20, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cocktailpeanut / sdxl-turbo
View on GitHub
☆17Dec 18, 2023Updated 2 years ago
conglu1997 / intelligent-go-explore
View on GitHub
Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models
☆69Apr 16, 2026Updated 3 months ago
TengHu / AutoCoder
View on GitHub
☆11Jan 28, 2024Updated 2 years ago
turboderp-org / exui
View on GitHub
Web UI for ExLlamaV2
☆513Feb 5, 2025Updated last year
SensAI-PT / LLaMa2lang
View on GitHub
Convenience scripts to finetune (chat-)LLaMa3 and other models for any language
☆310Jun 17, 2024Updated 2 years ago
danikhan632 / guidance_api
View on GitHub
An Extension for oobabooga/text-generation-webui
☆37Jul 15, 2023Updated 3 years ago
dkruyt / webaisum
View on GitHub
WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …
☆15Apr 28, 2024Updated 2 years ago