FareedKhan-dev / Building-llama3-from-scratchLinks

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.

☆175

Alternatives and similar repositories for Building-llama3-from-scratch

Users that are interested in Building-llama3-from-scratch are comparing it to the libraries listed below

Sorting:

FareedKhan-dev / create-million-parameter-llm-from-scratch
Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆181Updated last year
pdaicode / awesome-LLMs-finetuning
Collection of resources for finetuning Large Language Models (LLMs).
☆93Updated 6 months ago
FareedKhan-dev / gpt4o-from-scratch
Implementation of a GPT-4o like Multimodal from Scratch using Python
☆69Updated 4 months ago
AviSoori1x / seemore
From scratch implementation of a vision language model in pure PyTorch
☆234Updated last year
shivendrra / SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆148Updated last year
TrelisResearch / one-click-llms
One click templates for inferencing Language Models
☆203Updated this week
marklysze / LlamaIndex-RAG-WSL-CUDA
Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B
☆129Updated last year
TrelisResearch / install-guides
Various installation guides for Large Language Models
☆72Updated 3 months ago
FareedKhan-dev / rag-with-rl
Maximizing the Performance of a Simple RAG using RL
☆70Updated 4 months ago
deep-diver / llamaduo
[ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
☆313Updated 3 weeks ago
Decentralised-AI / LFM-Liquid-AI-Liquid-Foundation-Models
An open source implementation of LFMs from Liquid AI: Liquid Foundation Models
☆103Updated 10 months ago
FudanDNN-NLP / RAG
This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)
☆329Updated 7 months ago
apple / ml-superposition-prompting
☆145Updated last year
Curated-Awesome-Lists / awesome-llms-fine-tuning
Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs…
☆448Updated 8 months ago
spcl / MRAG
Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"
☆222Updated last month
TIGER-AI-Lab / LongRAG
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
☆235Updated 11 months ago
ali-bahrainian / RAG_best_practices
☆93Updated 4 months ago
arcee-ai / EvolKit
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆230Updated 9 months ago
ibm-granite / granite-3.0-language-models
☆261Updated last month
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆94Updated 4 months ago
rohan-paul / LLM-FineTuning-Large-Language-Models
LLM (Large Language Model) FineTuning
☆551Updated 4 months ago
FareedKhan-dev / train-llama4
Building LLaMA 4 MoE from Scratch
☆60Updated 3 months ago
ictnlp / Auto-RAG
This is the official repository for Auto-RAG.
☆218Updated 3 weeks ago
lamini-ai / Lamini-Memory-Tuning
Banishing LLM Hallucinations Requires Rethinking Generalization
☆276Updated last year
myeon9h / PlanRAG
Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24
☆142Updated last year
ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆34Updated 2 months ago
gabrielchua / daily-ai-papers
All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…
☆191Updated this week
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆320Updated 4 months ago
huggingface / gpt-oss-recipes
Collection of scripts and notebooks for OpenAI's latest GPT OSS models
☆222Updated this week
microsoft / llm-data-creation
Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"
☆135Updated last year