Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆201May 12, 2024Updated last year
Alternatives and similar repositories for create-million-parameter-llm-from-scratch
Users that are interested in create-million-parameter-llm-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆203Aug 23, 2024Updated last year
- Understanding Large Language Transformer Architecture like a child☆28Apr 3, 2024Updated last year
- In this blog, we will build a small scale text-to-video model from scratch. We will input a text prompt, and our trained model will gener…☆227Jun 23, 2024Updated last year
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- A straightforward method for training your LLM, from downloading data to generating text.☆537Aug 3, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- We have listed some of the free and powerful GenAI APIs and explore their benefit and usage.☆15Feb 3, 2024Updated 2 years ago
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆79Aug 18, 2025Updated 7 months ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 7 months ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budget☆167Aug 11, 2025Updated 7 months ago
- ☆12Jan 24, 2025Updated last year
- ☆30Jun 20, 2024Updated last year
- NLP/LLM Mlops Pipeline to dev/train/evaluation, scalable deploy and monitoring systems.☆22Mar 15, 2024Updated 2 years ago
- Intelligent Help for Efficient Programming☆18Jan 11, 2024Updated 2 years ago
- Trained a 114 million Parameter LLM from Scratch.☆19Jul 21, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆24Jun 12, 2024Updated last year
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- This repository is a voice search demo using OpenAI Whisper, DuckDB, and the Metaphone algorithm. The associate blog post is here: https:…☆13May 15, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 11 months ago
- 🚀 A structured data pipeline project using dbt and Snowflake to transform raw data into curated datasets. This project covers data inges…☆13Mar 17, 2025Updated last year
- An app that extracts your twitter threads into a downloadable CSV file.☆13Apr 8, 2023Updated 2 years ago
- An LLM-powered advanced RAG pipeline built from scratch☆860Jan 26, 2024Updated 2 years ago
- Building a GPT-like LLM from scratch with PyTorch.☆339Dec 20, 2024Updated last year
- A Step-by-Step Implementation of RAPTOR based RAG implementation☆38Sep 1, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An app to organize your research: A Paper Based Approach☆22Feb 26, 2023Updated 3 years ago
- Implementation of 12 AI agents evaluation techniques☆39Jul 31, 2025Updated 7 months ago
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆89,206Mar 21, 2026Updated last week
- Microservice for user authentication, authorization based on JWT mechanism with role-based access control. Project implement Event Driven…☆28May 15, 2025Updated 10 months ago
- Intuitive RAG system on top of LllamaIndex☆15Nov 8, 2024Updated last year
- ☆17May 23, 2025Updated 10 months ago
- prompt提示词工程快速上手☆28Aug 30, 2024Updated last year
- Rust ports of iovisor/bcc/tools☆21Sep 6, 2022Updated 3 years ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆31May 29, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- My portfolio page☆20Jan 13, 2026Updated 2 months ago
- Meditation generation using streamlit, OpenAI GPT and Google TTS☆10Mar 17, 2025Updated last year
- A static deobfuscator for JavaScript Malware☆13May 6, 2020Updated 5 years ago
- LLM query engine to retrieve augmented responses from json files.☆15Oct 12, 2023Updated 2 years ago
- "What the teacher is, is more important than what he teaches."― Karl Menninger☆16Sep 10, 2021Updated 4 years ago
- ☆13Jan 30, 2025Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 5 months ago