Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆202May 12, 2024Updated last year
Alternatives and similar repositories for create-million-parameter-llm-from-scratch
Users that are interested in create-million-parameter-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆204Aug 23, 2024Updated last year
- Understanding Large Language Transformer Architecture like a child☆28Apr 3, 2024Updated last year
- A straightforward method for training your LLM, from downloading data to generating text.☆524Aug 3, 2025Updated 7 months ago
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆77Aug 18, 2025Updated 6 months ago
- ☆15Updated this week
- Notes and code for Programming Massively Parallel Processors☆13Mar 29, 2025Updated 11 months ago
- Welcome to the Background Remover project! This tool allows you to effortlessly replace backgrounds in images and videos, making it perfe…☆11Feb 3, 2024Updated 2 years ago
- A full-stack web chatbot application integrated with Ollama☆12Jul 31, 2024Updated last year
- Using the OpenAI Gym library, I implemented two reinforcement learning algorithms in the Frozen Lake environment.☆11Feb 10, 2024Updated 2 years ago
- Implementation of 12 AI agents evaluation techniques☆36Jul 31, 2025Updated 7 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 4 months ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆37Sep 1, 2025Updated 6 months ago
- Intuitive RAG system on top of LllamaIndex☆15Nov 8, 2024Updated last year
- Intelligent Help for Efficient Programming☆18Jan 11, 2024Updated 2 years ago
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- Creating the DeepSeek V3 model from scratch☆25Mar 28, 2025Updated 11 months ago
- Llama from scratch, or How to implement a paper without crying☆584May 29, 2024Updated last year
- ☆24Jun 12, 2024Updated last year
- This repository contains end-to-end solutions for standard machine learning problems and problem statements shared in interviews☆23Mar 25, 2023Updated 2 years ago
- Detecting Pulse from Head Motions in Video☆20Jun 22, 2022Updated 3 years ago
- BlockchainGPT: An intuitive, chat-based platform to manage your blockchain environments using natural language processing capabilities.☆11Jul 6, 2023Updated 2 years ago
- U-Transkript is a powerful Python library for automatically extracting transcripts (subtitles) from YouTube videos and translating them i…☆13Feb 16, 2026Updated 3 weeks ago
- LoRA and DoRA from Scratch Implementations☆216Mar 5, 2024Updated 2 years ago
- 一些 LLM 方面的从零复现笔记☆244Apr 29, 2025Updated 10 months ago
- PredictHub is a sophisticated stock price prediction platform that combines machine learning with real-time market data analysis. The app…☆16Aug 15, 2025Updated 6 months ago
- ☆10Dec 13, 2023Updated 2 years ago
- ☆13Jan 30, 2025Updated last year
- ☆11Jul 29, 2025Updated 7 months ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31May 29, 2023Updated 2 years ago
- Building DeepSeek R1 from Scratch☆748Mar 21, 2025Updated 11 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆78Apr 4, 2025Updated 11 months ago
- Ingesting GraphRAG from microsoft into Neo4j for local visualisation. Using their Local and Global search and comparing the results in a …☆29Oct 27, 2024Updated last year
- An automated Python tool that uses LLMs and internet to automatically fix your code until it runs perfectly.☆31Jan 22, 2025Updated last year
- From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)☆790Oct 30, 2024Updated last year
- Proxies without the networking☆11Sep 15, 2023Updated 2 years ago
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct☆31Feb 23, 2025Updated last year
- Provides allocations and release strategies for resources used during the lifetime of a VM.☆33Mar 2, 2026Updated last week
- Jax like function transformation engine but micro, microjax☆34Oct 25, 2024Updated last year
- nanogpt turned into a chat model☆81Aug 30, 2023Updated 2 years ago