uygarkurt / Llama-3-PyTorchLinks
☆32Updated 5 months ago
Alternatives and similar repositories for Llama-3-PyTorch
Users that are interested in Llama-3-PyTorch are comparing it to the libraries listed below
Sorting:
- ☆41Updated last month
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆45Updated 9 months ago
- ☆39Updated last month
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆32Updated last month
- ☆133Updated 10 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 3 months ago
- Various installation guides for Large Language Models☆70Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆41Updated last year
- Collection of autoregressive model implementation☆85Updated 2 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated last year
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆53Updated last year
- Building GPT ...☆18Updated 6 months ago
- ☆87Updated last year
- One click templates for inferencing Language Models☆190Updated last week
- Prune transformer layers☆69Updated last year
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 8 months ago
- Quantization of LLMs and benchmarking.☆10Updated last year
- minimal GRPO implementation from scratch☆90Updated 3 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆198Updated 11 months ago
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated last year
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆55Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 7 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 9 months ago
- ☆31Updated last year
- ☆32Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆80Updated last month
- Simple GRPO scripts and configurations.☆58Updated 4 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated 10 months ago
- ☆27Updated 11 months ago