Train a 29M parameter GPT from Scratch
☆33Mar 4, 2025Updated 11 months ago
Alternatives and similar repositories for train-tiny-llm
Users that are interested in train-tiny-llm are comparing it to the libraries listed below
Sorting:
- An automated Python tool that uses LLMs and internet to automatically fix your code until it runs perfectly.☆31Jan 22, 2025Updated last year
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆77Aug 18, 2025Updated 6 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆78Apr 4, 2025Updated 10 months ago
- Intelligent Help for Efficient Programming☆18Jan 11, 2024Updated 2 years ago
- Deep research agentic system using Time Test Diffusion☆42Dec 11, 2025Updated 2 months ago
- Repository for CrewAI MCP demo codebase☆36Jul 17, 2025Updated 7 months ago
- Maximizing the Performance of a Simple RAG using RL☆90Mar 20, 2025Updated 11 months ago
- Lidar Panoptic Segmentation without Bells and Whistles (IROS 2023)☆25Oct 21, 2023Updated 2 years ago
- ☆13Jan 30, 2025Updated last year
- Examples of converting COBOL to Java using Ispirer Toolkit☆20Mar 3, 2025Updated last year
- 2D road segmentation using lidar data during training☆43Dec 21, 2023Updated 2 years ago
- A Snowflake SQL parser (WIP)☆11May 31, 2020Updated 5 years ago
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- Mixture of Experts from scratch☆13Apr 12, 2024Updated last year
- Eternium CSS Framework☆13Dec 26, 2025Updated 2 months ago
- Keyword extraction using Scake, KeyBERT, Fine-tuning Transformer BERT-like models and ChatGPT.☆12May 22, 2023Updated 2 years ago
- Hanime.tv stremio addon☆17Feb 10, 2026Updated 3 weeks ago
- FlawlessChips is a C# library that provides gate-level simulation of various 8-bit chips.☆10Jan 17, 2026Updated last month
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆200May 12, 2024Updated last year
- SemBleu: A Robust Metric for AMR Parsing Evaluation☆12Feb 22, 2021Updated 5 years ago
- Code for Rethinking Prompt Optimizers: From Prompt Merits to Optimization☆12Jan 12, 2026Updated last month
- A modern, web-based IDE for creating and editing p5.js sketches with AI assistance and Model Context Protocol (MCP) integration for Claud…☆22Jun 20, 2025Updated 8 months ago
- ☆11Oct 8, 2025Updated 4 months ago
- ☆10Sep 26, 2024Updated last year
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- gpt from 0 -> 1☆11Oct 9, 2025Updated 4 months ago
- Evolving LangChain agent architectures using the Quality-Diversity (QD) algorithm.☆16Aug 29, 2025Updated 6 months ago
- Javascript library for positional astronomy☆12Apr 27, 2023Updated 2 years ago
- AI agent that controls a computer☆53Feb 23, 2025Updated last year
- 12 Weeks, 24 Lessons, AI for All!☆10Aug 30, 2024Updated last year
- A desktop bot for fun...☆12Feb 2, 2023Updated 3 years ago
- A self-contained network between two Raspberry Pis which copy images to each other forever. Created for "Personal Photographs, September …☆11Jun 13, 2019Updated 6 years ago
- ☆19Jul 21, 2025Updated 7 months ago
- This is a project based on opencv-python which estimates height of an object based upon its picture. It uses a the height reference of a …☆10Dec 11, 2020Updated 5 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- An easy auto framework☆11Nov 14, 2023Updated 2 years ago
- C# library with very fast but not very accurate realisations of System.Math methods.☆12Jun 4, 2017Updated 8 years ago
- A file handle-based BMP image file reader for MicroPython.☆10Feb 14, 2023Updated 3 years ago
- A hover zoom effect to see a closer view of the image details.☆11Jan 13, 2025Updated last year