DreamGenX / DreamGenTrainLinks

☆14

Alternatives and similar repositories for DreamGenTrain

Users that are interested in DreamGenTrain are comparing it to the libraries listed below

Sorting:

Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆33Updated last year
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆105Updated last year
jeffrey-fong / Invoker
The one who calls upon functions - Function-Calling Language Model
☆36Updated last year
mzbac / mlx-moe
Scripts to create your own moe models using mlx
☆90Updated last year
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆66Updated last year
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆147Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆173Updated last year
zarakiquemparte / zaraki-tools
☆27Updated last year
cognitivecomputations / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆78Updated last year
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated last year
Maximilian-Winter / AIRoleplay
Little AI roleplay program
☆58Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
PygmalionAI / training-code
The code we currently use to fine-tune models.
☆113Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated last year
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
LostRuins / datasetexplorer
Easily view and modify JSON datasets for large language models
☆76Updated last month
cognitivecomputations / kraken
☆66Updated last year
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆101Updated last year
s4rduk4r / alpaca_lora_4bit_readme
Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit
☆31Updated 2 years ago
FartyPants / Training_PRO
Traing PRO extension for oobabooga WebUI - recent dev version
☆50Updated last week
OoriData / OgbujiPT
Client-side toolkit for using large language models, including where self-hosted
☆111Updated 7 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆183Updated last year
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆221Updated last year
atisharma / chasm_engine
CHAracter State Management - a generative text adventure (engine)
☆65Updated 8 months ago
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆162Updated last year
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆110Updated last year
mounta11n / plusplus-camall
After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…
☆54Updated 10 months ago
serp-ai / unsloth
5X faster 60% less memory QLoRA finetuning
☆21Updated last year