evintunador / minLlama3Links

a simplified version of Meta's Llama 3 model to be used for learning

☆43

Alternatives and similar repositories for minLlama3

Users that are interested in minLlama3 are comparing it to the libraries listed below

Sorting:

aju22 / LLaMA2
This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…
☆74Updated 2 years ago
hkproj / pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
☆361Updated 2 years ago
hkproj / mistral-src-commented
Reference implementation of Mistral AI 7B v0.1 model.
☆28Updated last year
hkproj / pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
☆117Updated 2 years ago
hkproj / pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
☆88Updated last year
AviSoori1x / seemore
From scratch implementation of a vision language model in pure PyTorch
☆251Updated last year
FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆191Updated last year
hkproj / rlhf-ppo
Notes and commented code for RLHF (PPO)
☆118Updated last year
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆215Updated 8 months ago
shreydan / VisionGPT2
Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.
☆46Updated 2 years ago
catid / dora
Implementation of DoRA
☆307Updated last year
rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆215Updated last year
keeeeenw / MicroLlama
Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget
☆163Updated 3 months ago
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆336Updated 7 months ago
coaxsoft / pytorch_bert
Tutorial for how to build BERT from scratch
☆100Updated last year
pacman100 / LLM-Workshop
LLM Workshop by Sourab Mangrulkar
☆397Updated last year
ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference 🚀
☆48Updated last year
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆100Updated 8 months ago
Jaykef / ai-algorithms
First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…
☆180Updated 4 months ago
hkproj / multi-latent-attention
☆45Updated 6 months ago
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆112Updated last year
hkproj / triton-flash-attention
☆222Updated 11 months ago
cindysridykhan / instruct_storyteller_tinyllama2
Training and Fine-tuning an llm in Python and PyTorch.
☆43Updated 2 years ago
dingo-actual / infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…
☆294Updated last year
NVIDIA-NeMo / Skills
A project to improve skills of large language models
☆628Updated this week
FareedKhan-dev / create-million-parameter-llm-from-scratch
Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆192Updated last year
huggingface / transformers-research-projects
Research projects built on top of Transformers
☆103Updated 8 months ago
CG80499 / KAN-GPT-2
Training small GPT-2 style models using Kolmogorov-Arnold networks.
☆122Updated last year
bkitano / llama-from-scratch
Llama from scratch, or How to implement a paper without crying
☆581Updated last year
NVlabs / Minitron
A family of compressed models obtained via pruning and knowledge distillation
☆359Updated last month