FareedKhan-dev / create-million-parameter-llm-from-scratchView external linksLinks
Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆197May 12, 2024Updated last year
Alternatives and similar repositories for create-million-parameter-llm-from-scratch
Users that are interested in create-million-parameter-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆200Aug 23, 2024Updated last year
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆75Aug 18, 2025Updated 5 months ago
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- ☆15Jan 26, 2026Updated 2 weeks ago
- Using the OpenAI Gym library, I implemented two reinforcement learning algorithms in the Frozen Lake environment.☆11Feb 10, 2024Updated 2 years ago
- Welcome to the Background Remover project! This tool allows you to effortlessly replace backgrounds in images and videos, making it perfe…☆11Feb 3, 2024Updated 2 years ago
- ☆12Mar 15, 2025Updated 10 months ago
- ☆14Apr 21, 2024Updated last year
- "What the teacher is, is more important than what he teaches."― Karl Menninger☆16Sep 10, 2021Updated 4 years ago
- playground for custom gpts built with agency-swarms (https://github.com/VRSEN/agency-swarm)☆14Jan 14, 2024Updated 2 years ago
- Implementation of 12 AI agents evaluation techniques☆35Jul 31, 2025Updated 6 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 4 months ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆36Sep 1, 2025Updated 5 months ago
- Intuitive RAG system on top of LllamaIndex☆15Nov 8, 2024Updated last year
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- Trained a 114 million Parameter LLM from Scratch.☆19Jul 21, 2024Updated last year
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆24Oct 13, 2025Updated 4 months ago
- Creating the DeepSeek V3 model from scratch☆24Mar 28, 2025Updated 10 months ago
- PaLM-Kosmos-Vision is a foundational project showcasing basic ChatGPT with vision capabilities, inviting further development for advanced…☆16Nov 15, 2023Updated 2 years ago
- Llama from scratch, or How to implement a paper without crying☆584May 29, 2024Updated last year
- NLP/LLM Mlops Pipeline to dev/train/evaluation, scalable deploy and monitoring systems.☆22Mar 15, 2024Updated last year
- ☆23Jun 12, 2024Updated last year
- This repository contains end-to-end solutions for standard machine learning problems and problem statements shared in interviews☆23Mar 25, 2023Updated 2 years ago
- Run OpenDevin inside Docker☆24Jul 22, 2025Updated 6 months ago
- Created and enhanced a local LLM training system on Apple Silicon with MLX and Metal API, overcoming the absence of CUDA support. Fine-tu…☆22May 29, 2024Updated last year
- BlockchainGPT: An intuitive, chat-based platform to manage your blockchain environments using natural language processing capabilities.☆11Jul 6, 2023Updated 2 years ago
- ☆27Jun 16, 2023Updated 2 years ago