uygarkurt / Llama-3-PyTorchLinks

☆33

Alternatives and similar repositories for Llama-3-PyTorch

Users that are interested in Llama-3-PyTorch are comparing it to the libraries listed below

Sorting:

hkproj / pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
☆111Updated 2 years ago
hkproj / multi-latent-attention
☆43Updated 2 months ago
sayakpaul / hf-codegen
A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.
☆53Updated last year
alperiox / Compact-Language-Models-via-Pruning-and-Knowledge-Distillation
Unofficial implementation of https://arxiv.org/pdf/2407.14679
☆46Updated 10 months ago
geronimi73 / phi2-finetune
☆87Updated last year
TrelisResearch / install-guides
Various installation guides for Large Language Models
☆71Updated 3 months ago
anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆112Updated last month
kabir2505 / tiny-mixtral
☆43Updated 2 months ago
Blaizzy / Coding-LLMs-from-scratch
☆32Updated last year
ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference 🚀
☆47Updated last year
hkproj / transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
☆295Updated 2 years ago
hkproj / pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
☆74Updated last year
ksm26 / Open-Source-Models-with-Hugging-Face
"Open Source Models with Hugging Face" course empowers you with the skills to leverage open-source models from the Hugging Face Hub for v…
☆27Updated last year
FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆172Updated 11 months ago
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated 9 months ago
hkproj / mistral-src-commented
Reference implementation of Mistral AI 7B v0.1 model.
☆28Updated last year
stas00 / ml-ways
ML/DL Math and Method notes
☆61Updated last year
huggingface / kernel-builder
👷 Build compute kernels
☆78Updated this week
geronimi73 / mamba
☆31Updated last year
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆188Updated last month
axolotl-ai-cloud / grpo_code
A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.
☆32Updated 3 months ago
PacktPublishing / Accelerate-Model-Training-with-PyTorch-2.X
Accelerate Model Training with PyTorch 2.X, published by Packt
☆46Updated last year
githubpradeep / notebooks
☆54Updated 5 months ago
rasbt / RAGs
RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems
☆123Updated 6 months ago
keeeeenw / MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
☆153Updated this week
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆117Updated 6 months ago
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆81Updated 2 months ago
1y33 / 100Days
GPU Kernels
☆191Updated 2 months ago
AviSoori1x / seemore
From scratch implementation of a vision language model in pure PyTorch
☆228Updated last year
ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆34Updated 2 months ago