Train a 29M parameter GPT from Scratch
☆35Mar 4, 2025Updated last year
Alternatives and similar repositories for train-tiny-llm
Users that are interested in train-tiny-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An automated Python tool that uses LLMs and internet to automatically fix your code until it runs perfectly.☆31Jan 22, 2025Updated last year
- Deep research agentic system using Time Test Diffusion☆45Dec 11, 2025Updated 3 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆78Apr 4, 2025Updated 11 months ago
- Python Essentials for AWS Cloud Developers, published by Packt.☆10Apr 27, 2023Updated 2 years ago
- ☆13Jan 30, 2025Updated last year
- Intelligent Help for Efficient Programming☆18Jan 11, 2024Updated 2 years ago
- Repository for CrewAI MCP demo codebase☆35Jul 17, 2025Updated 8 months ago
- Code for the article series on building a Python compiler and interpreter☆11Feb 13, 2025Updated last year
- (NeurIPS 2025 🔥) Official implementation for "Efficient Multi-modal Large Language Models via Progressive Consistency Distillation"☆46Feb 11, 2026Updated last month
- Create a RAG AI Agent with PydanticAI and OpenAI☆15Jan 11, 2025Updated last year
- This project provides an AI-driven test case generator using FastAPI. The application accepts a GitHub repository name and generates test…☆19Jun 7, 2024Updated last year
- An MCP server that standardizes and contextualizes industrial Modbus data.☆20May 12, 2025Updated 10 months ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- Paper introducing jax-cosmo☆13Apr 27, 2023Updated 2 years ago
- Dashboard build with NextJs with kanban and calendar apps.☆13Nov 3, 2022Updated 3 years ago
- All the content of my youtube channel : https://youtube.com/@florenzerstling?si=7t10PBr6MDha74PO☆14May 28, 2025Updated 9 months ago
- ☆14Nov 16, 2024Updated last year
- TensorRT depth-anything for anyone and anywhere☆15Jan 29, 2024Updated 2 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆10Jul 27, 2024Updated last year
- The open-access code of an interpretable machine learning-based method for room temperature prediction in a non-domestic building.☆16Aug 3, 2024Updated last year
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆201May 12, 2024Updated last year
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 3 years ago
- ☆11Sep 25, 2022Updated 3 years ago
- Implementation of 12 AI agents evaluation techniques☆37Jul 31, 2025Updated 7 months ago
- FlawlessChips is a C# library that provides gate-level simulation of various 8-bit chips.☆10Mar 15, 2026Updated last week
- Mixture of Experts from scratch☆13Apr 12, 2024Updated last year
- ☆15Apr 21, 2024Updated last year
- 数据库内核笔记☆13Aug 18, 2022Updated 3 years ago
- Tutorials regarding towards data science☆19Oct 18, 2025Updated 5 months ago
- GEMV implementation with CUTLASS☆19Aug 21, 2025Updated 7 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆14Dec 16, 2024Updated last year
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- ☆19Jul 21, 2025Updated 8 months ago
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆25Jan 3, 2026Updated 2 months ago
- Tools to easily integrate Anthropic Model Context Protocol(MCP) with Langchain☆17Feb 17, 2025Updated last year
- gpt from 0 -> 1☆11Oct 9, 2025Updated 5 months ago
- Examples to use Azure with LLMs for Chat☆16Jan 8, 2024Updated 2 years ago
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆23Feb 12, 2026Updated last month
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆20Jan 24, 2025Updated last year