mallik3006 / LLM_fine_tuning_llama3_8bLinks

Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed

☆19

Alternatives and similar repositories for LLM_fine_tuning_llama3_8b

Users that are interested in LLM_fine_tuning_llama3_8b are comparing it to the libraries listed below

Sorting:

daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆214Updated last year
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆99Updated 8 months ago
KyujinHan / Sakura-SOLAR-DPO
Sakura-SOLAR-DPO: Merge, SFT, and DPO
☆116Updated last year
huggingface / transformers-research-projects
Research projects built on top of Transformers
☆100Updated 8 months ago
huggingface / data-is-better-together
Let's build better datasets, together!
☆264Updated 11 months ago
FareedKhan-dev / train-llama4
Building LLaMA 4 MoE from Scratch
☆68Updated 7 months ago
NVIDIA / logits-processor-zoo
A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.
☆372Updated 4 months ago
huggingface / competitions
☆124Updated last year
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆242Updated last year
swj0419 / detect-pretrain-code-contamination
☆78Updated last year
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆111Updated last year
jshuadvd / LongRoPE
Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper
☆152Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆231Updated last year
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated 2 months ago
nlp-uoregon / oregon_gpt_oss_patching
Efficient Finetuning for OpenAI GPT-OSS
☆22Updated last month
pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆395Updated last year
Pleias / Various-Finetuning
Set of scripts to finetune LLMs
☆38Updated last year
deep-diver / llamaduo
[ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
☆314Updated 4 months ago
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆139Updated last year
pacman100 / openhathi_instruct
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…
☆23Updated last year
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆169Updated last year
FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆190Updated last year
apple / ml-hypercloning
☆52Updated last year
QuixiAI / spectrum
☆138Updated 2 months ago
Delve-ERAV1 / Phi-2-Vision-Language
Pretraining and finetuning for visual instruction following with Mixture of Experts
☆16Updated last year
UpstageAI / evalverse
The Universe of Evaluation. All about the evaluation for LLMs.
☆229Updated last year
hkproj / pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
☆117Updated 2 years ago
hamelsmu / llama-inference
experiments with inference on llama
☆103Updated last year
wjbmattingly / qwen2-vl-finetune-huggingface
This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.
☆77Updated 4 months ago