FareedKhan-dev / Building-llama3-from-scratchLinks
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆200Updated last year
Alternatives and similar repositories for Building-llama3-from-scratch
Users that are interested in Building-llama3-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆197Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆130Updated last year
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Updated 10 months ago
- From scratch implementation of a vision language model in pure PyTorch☆254Updated last year
- Maximizing the Performance of a Simple RAG using RL☆90Updated 10 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- ☆147Updated last year
- ☆107Updated 10 months ago
- This is an open-source version of OpenAI's O1 Model Series by Siraj Raval & O1-Preview☆98Updated last year
- One click templates for inferencing Language Models☆228Updated 2 months ago
- ☆92Updated last week
- Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs…☆502Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆116Updated last year
- Various installation guides for Large Language Models☆77Updated 9 months ago
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆344Updated last year
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆212Updated 3 months ago
- a simplified version of Meta's Llama 3 model to be used for learning☆44Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆239Updated 4 months ago
- Collection of resources for finetuning Large Language Models (LLMs).☆111Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated 6 months ago
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆31Updated 11 months ago
- LLM (Large Language Model) FineTuning☆565Updated 10 months ago
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆336Updated 2 years ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆122Updated 2 years ago
- LLaMA 2 implemented from scratch in PyTorch☆366Updated 2 years ago
- Building LLaMA 4 MoE from Scratch☆72Updated 9 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆250Updated last year
- A straightforward method for training your LLM, from downloading data to generating text.☆512Updated 6 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆49Updated last year
- Automatically evaluate your LLMs in Google Colab☆685Updated last year