hunar4321 / reweight-gptLinks

Reweight GPT - a simple neural network using transformer architecture for next character prediction

☆58

Alternatives and similar repositories for reweight-gpt

Users that are interested in reweight-gpt are comparing it to the libraries listed below

Sorting:

TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆160Updated 2 years ago
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆67Updated last year
aigoopy / llm-jeopardy
Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts
☆108Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated 2 years ago
kyegomez / Andromeda
An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast
☆150Updated last year
emrgnt-cmplxty / zero-shot-replication
☆73Updated 2 years ago
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆207Updated last year
Birch-san / falcon-play
Command-line script for inferencing from models such as falcon-7b-instruct
☆74Updated 2 years ago
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆169Updated last year
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆103Updated 5 months ago
LexiestLeszek / namegen
Self-contained, minimalistic implementation of a language model that generates coherent and normal sounding names. It uses an input datas…
☆51Updated last year
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆146Updated 2 years ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆145Updated 8 months ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆177Updated last year
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆106Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆93Updated last month
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
NolanoOrg / smol-gpt
Smol but mighty language model
☆62Updated 2 years ago
desik1998 / MathWithLLMs
☆49Updated last year
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆231Updated 11 months ago
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
huu4ontocord / MDEL
Multi-Domain Expert Learning
☆66Updated last year
ChuloAI / BrainChulo
Harnessing the Memory Power of the Camelids
☆147Updated 2 years ago
rafacelente / bllama
1.58-bit LLaMa model
☆83Updated last year
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated 2 years ago
skeskinen / llama-lite
Embeddings focused small version of Llama NLP model
☆105Updated 2 years ago
neuml / txtinstruct
📚 Datasets and models for instruction-tuning
☆237Updated 2 years ago
aspctu / alpaca-lora
Instruct-tuning LLaMA on consumer hardware
☆65Updated 2 years ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated last year