Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆210May 12, 2024Updated 2 years ago
Alternatives and similar repositories for create-million-parameter-llm-from-scratch
Users that are interested in create-million-parameter-llm-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Understanding Large Language Transformer Architecture like a child☆34Apr 3, 2024Updated 2 years ago
- Building a multi-agent RAG system with advanced RAG methods☆13Jan 12, 2025Updated last year
- Train a 29M parameter GPT from Scratch☆45Mar 4, 2025Updated last year
- We have listed some of the free and powerful GenAI APIs and explore their benefit and usage.☆16Feb 3, 2024Updated 2 years ago
- From-scratch Llama 2-inspired Transformer in PyTorch for exploring tokenization, pre-training, and inference.☆39Jun 12, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Notes and code for Programming Massively Parallel Processors☆13Mar 29, 2025Updated last year
- A straightforward method for training your LLM, from downloading data to generating text.☆7,841Jun 24, 2026Updated last week
- 100 Days of GPU Challenge☆26Nov 15, 2025Updated 7 months ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆168Aug 11, 2025Updated 10 months ago
- ☆30Jun 20, 2024Updated 2 years ago
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- Llama from scratch, or How to implement a paper without crying☆580May 29, 2024Updated 2 years ago
- ☆12Feb 16, 2026Updated 4 months ago
- Notebooks from YouTube videos☆18Dec 27, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Intelligent Help for Efficient Programming☆18Jan 11, 2024Updated 2 years ago
- ☆24Jun 12, 2024Updated 2 years ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆22Jun 29, 2024Updated 2 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆16Apr 30, 2025Updated last year
- gantt-view-js extend Jquery☆12Aug 16, 2017Updated 8 years ago
- A full-stack web chatbot application integrated with Ollama☆12Jul 31, 2024Updated last year
- An LLM-powered advanced RAG pipeline built from scratch☆859Jan 26, 2024Updated 2 years ago
- ☆13May 10, 2022Updated 4 years ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆42Sep 1, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- LLM as World Models using Bayesian inference☆21May 27, 2025Updated last year
- An app to organize your research: A Paper Based Approach☆22Feb 26, 2023Updated 3 years ago
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 11 months ago
- Creating the DeepSeek V3 model from scratch☆28Mar 28, 2025Updated last year
- ☆13Mar 1, 2025Updated last year
- Microservice for user authentication, authorization based on JWT mechanism with role-based access control. Project implement Event Driven…☆29May 15, 2025Updated last year
- Intuitive RAG system on top of LllamaIndex☆15Nov 8, 2024Updated last year
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆98,270Jun 2, 2026Updated 3 weeks ago
- made a chatbot based on openai gpt model that can search google. made with langchain and gradio ui☆26Apr 14, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆35Dec 16, 2025Updated 6 months ago
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆24Nov 26, 2025Updated 7 months ago
- 一些 LLM 方面的从零复现笔记☆253Apr 29, 2025Updated last year
- prompt提示词工程快速上手☆30Aug 30, 2024Updated last year
- A curated collection of prompts for Grok Imagine by xAI☆32Jun 6, 2026Updated 3 weeks ago
- Created and enhanced a local LLM training system on Apple Silicon with MLX and Metal API, overcoming the absence of CUDA support. Fine-tu…☆28May 29, 2024Updated 2 years ago
- LLM query engine to retrieve augmented responses from json files.☆15Oct 12, 2023Updated 2 years ago