FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆161Updated 8 months ago
Alternatives and similar repositories for Building-llama3-from-scratch
Users that are interested in Building-llama3-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆164Updated last year
- From scratch implementation of a vision language model in pure PyTorch☆214Updated last year
- Maximizing the Performance of a Simple RAG using RL☆57Updated last month
- ☆112Updated 5 months ago
- This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)☆316Updated 4 months ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆126Updated last year
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆230Updated 8 months ago
- ☆143Updated 9 months ago
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…☆82Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆217Updated 6 months ago
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.☆304Updated last month
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆59Updated last month
- Building LLaMA 4 MoE from Scratch☆43Updated last month
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆29Updated 2 months ago
- Simple example to showcase how to use llamaparser to parse PDF files☆84Updated 7 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated last month
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆137Updated 11 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆208Updated 6 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆274Updated 10 months ago
- ☆72Updated last year
- How to build a Multi-Agentic Systems for RAG using LangGraph - Full project☆114Updated 4 months ago
- ☆42Updated last year
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆453Updated 4 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆110Updated 3 months ago
- ☆30Updated last week
- Various installation guides for Large Language Models☆69Updated 3 weeks ago
- Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…☆12Updated 7 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆203Updated 3 weeks ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆32Updated 3 months ago
- ☆65Updated 2 months ago