FareedKhan-dev / Building-llama3-from-scratchLinks
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆188Updated last year
Alternatives and similar repositories for Building-llama3-from-scratch
Users that are interested in Building-llama3-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆189Updated last year
- Collection of resources for finetuning Large Language Models (LLMs).☆101Updated 9 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆73Updated 6 months ago
- From scratch implementation of a vision language model in pure PyTorch☆246Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated 3 months ago
- Various installation guides for Large Language Models☆75Updated 6 months ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆129Updated last year
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆162Updated last year
- ☆264Updated 4 months ago
- Notes and commented code for RLHF (PPO)☆113Updated last year
- Building LLaMA 4 MoE from Scratch☆67Updated 6 months ago
- Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data …☆70Updated 2 years ago
- minimal GRPO implementation from scratch☆98Updated 7 months ago
- Maximizing the Performance of a Simple RAG using RL☆82Updated 7 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆240Updated last year
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆203Updated this week
- LLM (Large Language Model) FineTuning☆564Updated 6 months ago
- ☆146Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆338Updated 10 months ago
- One click templates for inferencing Language Models☆215Updated 2 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆230Updated 3 weeks ago
- Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs…☆478Updated 10 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆240Updated last year
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆455Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 5 months ago
- RAG-VectorDB-Embedings-LlamaIndex-Langchain☆266Updated last week
- Build datasets using natural language☆534Updated last month
- Automatically evaluate your LLMs in Google Colab☆660Updated last year
- A straightforward method for training your LLM, from downloading data to generating text.☆455Updated 2 months ago
- ☆96Updated 7 months ago