Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆204May 12, 2024Updated last year
Alternatives and similar repositories for create-million-parameter-llm-from-scratch
Users that are interested in create-million-parameter-llm-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆207Aug 23, 2024Updated last year
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- A straightforward method for training your LLM, from downloading data to generating text.☆549Aug 3, 2025Updated 8 months ago
- Train a 29M parameter GPT from Scratch☆35Mar 4, 2025Updated last year
- We have listed some of the free and powerful GenAI APIs and explore their benefit and usage.☆15Feb 3, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implements a LLM similar to Meta's Llama 2 from the ground up in PyTorch, for educational purposes.☆38Feb 7, 2025Updated last year
- Notes and code for Programming Massively Parallel Processors☆13Mar 29, 2025Updated last year
- A pytorch Implementation of the Transformer: Attention Is All You Need☆14Jun 7, 2024Updated last year
- 100 Days of GPU Challenge☆27Nov 15, 2025Updated 5 months ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 7 months ago
- This repo walks you through how to use transfer learning to fine tune a LLM (large language model) using UK Supreme Court case law as the…☆44Aug 8, 2023Updated 2 years ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆167Aug 11, 2025Updated 8 months ago
- Gemini, as capable as GPT-4, provides a free API with limited access. I tested it with the help of prompt engineering and found that it c…☆36Jan 19, 2024Updated 2 years ago
- Implementation of various data science techniques and research papers☆31Dec 15, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- Llama from scratch, or How to implement a paper without crying☆581May 29, 2024Updated last year
- ☆12Feb 16, 2026Updated 2 months ago
- Notebooks from YouTube videos☆19Dec 27, 2021Updated 4 years ago
- Trained a 114 million Parameter LLM from Scratch.☆19Jul 21, 2024Updated last year
- This repository is a voice search demo using OpenAI Whisper, DuckDB, and the Metaphone algorithm. The associate blog post is here: https:…☆13May 15, 2024Updated last year
- gantt-view-js extend Jquery☆12Aug 16, 2017Updated 8 years ago
- ☆28Jun 16, 2023Updated 2 years ago
- A full-stack web chatbot application integrated with Ollama☆12Jul 31, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An LLM-powered advanced RAG pipeline built from scratch☆857Jan 26, 2024Updated 2 years ago
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆38Sep 1, 2025Updated 7 months ago
- Building a GPT-like LLM from scratch with PyTorch.☆347Dec 20, 2024Updated last year
- Implementation of 12 AI agents evaluation techniques☆39Jul 31, 2025Updated 8 months ago
- Creating the DeepSeek V3 model from scratch☆27Mar 28, 2025Updated last year
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆90,803Apr 11, 2026Updated last week
- A thin veneer of F#ness arround several different frameworks to make a light weight Mvc framework.☆17Sep 5, 2011Updated 14 years ago
- Microservice for user authentication, authorization based on JWT mechanism with role-based access control. Project implement Event Driven…☆28May 15, 2025Updated 11 months ago
- Intuitive RAG system on top of LllamaIndex☆15Nov 8, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Welcome to the Background Remover project! This tool allows you to effortlessly replace backgrounds in images and videos, making it perfe…☆11Feb 3, 2024Updated 2 years ago
- made a chatbot based on openai gpt model that can search google. made with langchain and gradio ui☆26Apr 14, 2023Updated 3 years ago
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆35Dec 16, 2025Updated 4 months ago
- We tried to make an app using which users can book Bike and one rider along with a bike can help customers to reach his/her destiny. This…☆12Jul 14, 2021Updated 4 years ago
- 一些 LLM 方面的从零复现笔记☆249Apr 29, 2025Updated 11 months ago
- Simple repository for training small reasoning models☆50Feb 17, 2026Updated 2 months ago
- This project contains a step-by-step guide on how to design an advanced agentic memory for your LLM based applications.☆52Apr 28, 2025Updated 11 months ago