uygarkurt / Llama-3-PyTorchLinks
☆41Updated last year
Alternatives and similar repositories for Llama-3-PyTorch
Users that are interested in Llama-3-PyTorch are comparing it to the libraries listed below
Sorting:
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆122Updated 2 years ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆197Updated last year
- ☆45Updated 8 months ago
- From scratch implementation of a vision language model in pure PyTorch☆254Updated last year
- ☆234Updated last year
- Learn the building blocks of how to build gpt-oss from scratch☆110Updated 4 months ago
- Accelerate Model Training with PyTorch 2.X, published by Packt☆51Updated last month
- ☆46Updated 8 months ago
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆334Updated 2 years ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆169Updated 5 months ago
- GPU Kernels☆218Updated 9 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆49Updated last year
- An extension of the nanoGPT repository for training small MOE models.☆233Updated 10 months ago
- Collection of autoregressive model implementation☆85Updated 2 weeks ago
- minimal GRPO implementation from scratch☆102Updated 10 months ago
- LLaMA 2 implemented from scratch in PyTorch☆365Updated 2 years ago
- Distributed training (multi-node) of a Transformer model☆92Updated last year
- LoRA and DoRA from Scratch Implementations☆215Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆197Updated 8 months ago
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated 2 years ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeed☆19Updated last year
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆53Updated last year
- Various installation guides for Large Language Models☆77Updated 9 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆119Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆197Updated last year
- Recreating PyTorch from scratch (C/C++, CUDA, NCCL and Python, with multi-GPU support and automatic differentiation!)☆162Updated 2 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated last year
- 👷 Build compute kernels☆214Updated this week
- ☆137Updated last year
- ☆114Updated 4 months ago