evintunador / minLlama3
a simplified version of Meta's Llama 3 model to be used for learning
☆41Updated 10 months ago
Alternatives and similar repositories for minLlama3:
Users that are interested in minLlama3 are comparing it to the libraries listed below
- Notes and commented code for RLHF (PPO)☆77Updated last year
- This repository contains an implementation of the LLaMA 2 (Large Language Model Meta AI) model, a Generative Pretrained Transformer (GPT)…☆63Updated last year
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆144Updated 11 months ago
- LLaMA 2 implemented from scratch in PyTorch☆307Updated last year
- Training and Fine-tuning an llm in Python and PyTorch.☆41Updated last year
- minimal GRPO implementation from scratch☆62Updated last week
- Train your own small bitnet model☆65Updated 5 months ago
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆83Updated this week
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆129Updated 8 months ago
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆448Updated 11 months ago
- [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning☆349Updated 6 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆105Updated 6 months ago
- Combining ViT and GPT-2 for image captioning. Trained on MS-COCO. The model was implemented mostly from scratch.☆42Updated last year
- Official repository for ORPO☆445Updated 9 months ago
- Distributed training (multi-node) of a Transformer model☆62Updated 11 months ago
- a simplified version of Google's Gemma model to be used for learning☆24Updated last year
- Implementation of DoRA☆291Updated 9 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆150Updated 7 months ago
- Official PyTorch implementation of QA-LoRA☆129Updated last year
- X-LoRA: Mixture of LoRA Experts☆214Updated 7 months ago
- [ACL 2024] Progressive LLaMA with Block Expansion.☆499Updated 10 months ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆165Updated 3 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆205Updated 10 months ago
- An extension of the nanoGPT repository for training small MOE models.☆106Updated 2 weeks ago
- A project to improve skills of large language models☆259Updated this week
- ☆220Updated 9 months ago
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention…☆287Updated 10 months ago
- Minimal hackable GRPO implementation☆187Updated last month
- customizable template GPT code designed for easy novel architecture experimentation☆26Updated last week
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆39Updated 3 weeks ago