uygarkurt / Llama-3-PyTorchLinks
☆33Updated 6 months ago
Alternatives and similar repositories for Llama-3-PyTorch
Users that are interested in Llama-3-PyTorch are comparing it to the libraries listed below
Sorting:
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆111Updated 2 years ago
- ☆43Updated 2 months ago
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆53Updated last year
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆46Updated 10 months ago
- ☆87Updated last year
- Various installation guides for Large Language Models☆71Updated 3 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated last month
- ☆43Updated 2 months ago
- ☆32Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated last year
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆295Updated 2 years ago
- Distributed training (multi-node) of a Transformer model☆74Updated last year
- "Open Source Models with Hugging Face" course empowers you with the skills to leverage open-source models from the Hugging Face Hub for v…☆27Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆172Updated 11 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 9 months ago
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated last year
- ML/DL Math and Method notes☆61Updated last year
- 👷 Build compute kernels☆78Updated this week
- ☆31Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆188Updated last month
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆32Updated 3 months ago
- Accelerate Model Training with PyTorch 2.X, published by Packt☆46Updated last year
- ☆54Updated 5 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆123Updated 6 months ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆153Updated this week
- Google TPU optimizations for transformers models☆117Updated 6 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆81Updated 2 months ago
- GPU Kernels☆191Updated 2 months ago
- From scratch implementation of a vision language model in pure PyTorch☆228Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆34Updated 2 months ago