hkproj / pytorch-llama-notesLinks

Notes about LLaMA 2 model

☆68

Alternatives and similar repositories for pytorch-llama-notes

Users that are interested in pytorch-llama-notes are comparing it to the libraries listed below

Sorting:

hkproj / pytorch-llama
LLaMA 2 implemented from scratch in PyTorch
☆358Updated 2 years ago
cmu-l3 / anlp-spring2025-code
Advanced NLP, Spring 2025 https://cmu-l3.github.io/anlp-spring2025/
☆66Updated 7 months ago
aju22 / LLaMA2
This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…
☆72Updated 2 years ago
hkproj / rlhf-ppo
Notes and commented code for RLHF (PPO)
☆111Updated last year
hkproj / triton-flash-attention
☆209Updated 9 months ago
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆202Updated 7 months ago
stanford-cs336 / spring2024-lectures
☆386Updated 10 months ago
hkproj / pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
☆85Updated last year
hkproj / transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
☆318Updated 2 years ago
hkproj / dpo-notes
Notes on Direct Preference Optimization
☆23Updated last year
NVlabs / Minitron
A family of compressed models obtained via pruning and knowledge distillation
☆354Updated 11 months ago
hkproj / quantization-notes
Notes on quantization in neural networks
☆104Updated last year
facebookresearch / LayerSkip
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
☆344Updated 5 months ago
FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆186Updated last year
neubig / minllama-assignment
☆98Updated last year
huggingface / picotron_tutorial
☆224Updated last week
hkproj / mistral-llm-notes
Notes on the Mistral AI model
☆20Updated last year
pprp / Awesome-LLM-Quantization
Awesome list for LLM quantization
☆326Updated 2 weeks ago
astramind-ai / Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
☆174Updated last year
evintunador / triton_docs_tutorials
making the official triton tutorials actually comprehensible
☆57Updated 2 months ago
sihyeong / Awesome-LLM-Inference-Engine
☆139Updated 4 months ago
rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆211Updated last year
shangshang-wang / Tina
Tina: Tiny Reasoning Models via LoRA
☆299Updated last month
hkproj / retrieval-augmented-generation-notes
Slides for "Retrieval Augmented Generation" video
☆21Updated last year
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆98Updated 7 months ago
Macaronlin / LLaMA3-Quantization
A repository dedicated to evaluating the performance of quantizied LLaMA3 using various quantization methods..
☆195Updated 9 months ago
hkproj / pytorch-paligemma
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw
☆563Updated 10 months ago
hkproj / pytorch-lora
LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch
☆117Updated 2 years ago
FairyFali / SLMs-Survey
Survey of Small Language Models from Penn State, ...
☆207Updated 2 weeks ago
hkproj / mistral-src-commented
Reference implementation of Mistral AI 7B v0.1 model.
☆28Updated last year