Andrei-Aksionov / nanoGPTplusLinks

☆51

Alternatives and similar repositories for nanoGPTplus

Users that are interested in nanoGPTplus are comparing it to the libraries listed below

Sorting:

daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
cindysridykhan / instruct_storyteller_tinyllama2
Training and Fine-tuning an llm in Python and PyTorch.
☆43Updated 2 years ago
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆231Updated last year
abacaj / train-with-fsdp
☆94Updated 2 years ago
muellerzr / minimal-trainer-zoo
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆195Updated last year
promptslab / LLMtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
☆242Updated last year
Preemo-Inc / text-generation-inference
☆197Updated last year
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆139Updated last year
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆169Updated last year
SeanNaren / min-LLM
Minimal code to train a Large Language Model (LLM).
☆172Updated 3 years ago
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆102Updated last year
bobazooba / xllm
🦖 X—LLM: Cutting Edge & Easy LLM Finetuning
☆406Updated last year
jondurbin / bagel
A bagel, with everything.
☆324Updated last year
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆169Updated last year
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆160Updated 2 years ago
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated last year
hunar4321 / reweight-gpt
Reweight GPT - a simple neural network using transformer architecture for next character prediction
☆58Updated 2 years ago
neuralmagic / nm-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆266Updated last year
explodinggradients / Funtuner
Supervised instruction finetuning for LLM with HF trainer and Deepspeed
☆36Updated 2 years ago
neuml / txtinstruct
📚 Datasets and models for instruction-tuning
☆237Updated 2 years ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated 2 years ago
geronimi73 / phi2-finetune
☆88Updated last year
hamelsmu / llama-inference
experiments with inference on llama
☆103Updated last year
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆75Updated last year
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆273Updated last year
pbelcak / UltraFastBERT
The repository for the code of the UltraFastBERT paper
☆518Updated last year
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆118Updated 2 years ago
keeeeenw / MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
☆161Updated 2 months ago
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆289Updated 7 months ago
SumanthRH / tokenization
A comprehensive deep dive into the world of tokens
☆226Updated last year