FareedKhan-dev / train-llama4
Building LLaMA 4 MoE from Scratch
☆32Updated last week
Alternatives and similar repositories for train-llama4:
Users that are interested in train-llama4 are comparing it to the libraries listed below
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆52Updated 3 weeks ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆157Updated 8 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆31Updated 2 months ago
- Notebooks for fine tuning pali gemma☆100Updated last week
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆40Updated 2 months ago
- Maximizing the Performance of a Simple RAG using RL☆55Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 10 months ago
- minimal GRPO implementation from scratch☆85Updated last month
- Composition of Multimodal Language Models From Scratch☆14Updated 8 months ago
- ☆90Updated last month
- From scratch implementation of a vision language model in pure PyTorch☆213Updated 11 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆158Updated 11 months ago
- Notes and commented code for RLHF (PPO)☆86Updated last year
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated last year
- Unsloth Fine-tuning Notebooks for Google Colab, Kaggle, Hugging Face and more.☆130Updated this week
- Find your Twin Celebrity in Vector Space☆34Updated 3 months ago
- ☆111Updated 5 months ago
- Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data …☆56Updated last year
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Updated 7 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 5 months ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆126Updated last year
- ☆45Updated 3 weeks ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated 3 weeks ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 11 months ago
- Building LLMs from scratch following the book from S. Raschka☆30Updated 3 weeks ago
- building a Large Language Model (LLM) from scratch.☆31Updated 2 months ago
- Various installation guides for Large Language Models☆69Updated this week
- ☆28Updated 5 months ago
- Qwen2 VL Fine Tuning using Llama Factory☆20Updated 7 months ago
- nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)☆100Updated last week