FareedKhan-dev / Building-llama3-from-scratchLinks
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆195Updated last year
Alternatives and similar repositories for Building-llama3-from-scratch
Users that are interested in Building-llama3-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆195Updated last year
- Collection of resources for finetuning Large Language Models (LLMs).☆107Updated 11 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆74Updated 8 months ago
- Various installation guides for Large Language Models☆77Updated 8 months ago
- From scratch implementation of a vision language model in pure PyTorch☆254Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆344Updated last year
- LLM (Large Language Model) FineTuning☆567Updated 8 months ago
- Building LLaMA 4 MoE from Scratch☆70Updated 8 months ago
- Maximizing the Performance of a Simple RAG using RL☆87Updated 9 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆167Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated 5 months ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆132Updated last year
- Notes and commented code for RLHF (PPO)☆120Updated last year
- ☆148Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆236Updated 3 months ago
- One click templates for inferencing Language Models☆222Updated last month
- A straightforward method for training your LLM, from downloading data to generating text.☆493Updated 4 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆245Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆49Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆350Updated 6 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆118Updated 2 years ago
- minimal GRPO implementation from scratch☆101Updated 9 months ago
- LLaMA 2 implemented from scratch in PyTorch☆363Updated 2 years ago
- ☆267Updated 6 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆113Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- ☆103Updated 9 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆115Updated 7 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆141Updated 11 months ago
- Build datasets using natural language☆556Updated 3 months ago