FareedKhan-dev / Building-llama3-from-scratchLinks
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆178Updated last year
Alternatives and similar repositories for Building-llama3-from-scratch
Users that are interested in Building-llama3-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆182Updated last year
- Collection of resources for finetuning Large Language Models (LLMs).☆97Updated 7 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆226Updated 2 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆69Updated 4 months ago
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated last month
- A compact LLM pretrained in 9 days by using high quality data☆322Updated 4 months ago
- Building LLaMA 4 MoE from Scratch☆60Updated 4 months ago
- ☆262Updated 2 months ago
- ☆145Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆129Updated last year
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆331Updated 8 months ago
- Various installation guides for Large Language Models☆72Updated 4 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆238Updated 9 months ago
- Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs…☆459Updated 8 months ago
- Maximizing the Performance of a Simple RAG using RL☆79Updated 5 months ago
- One click templates for inferencing Language Models☆211Updated 3 weeks ago
- From scratch implementation of a vision language model in pure PyTorch☆235Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆145Updated last year
- LLM (Large Language Model) FineTuning☆559Updated 4 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆238Updated last year
- RAFT, or Retrieval-Augmented Fine-Tuning, is a method comprising of a fine-tuning and a RAG-based retrieval phase. It is particularly sui…☆148Updated 11 months ago
- GenAIOps on Kubernetes: A collection of reference architectures for running GenAI at scale on Kubernetes using OSS tooling☆132Updated 10 months ago
- ☆94Updated 5 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆34Updated 3 months ago
- Unlock the potential of finetuning Large Language Models (LLMs). Learn from industry expert, and discover when to apply finetuning, data …☆65Updated last year
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆113Updated last year
- Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,A…☆375Updated last year
- ☆43Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆135Updated last year