FareedKhan-dev / train-tiny-llmLinks

Train a 29M parameter GPT from Scratch

☆20

Alternatives and similar repositories for train-tiny-llm

Users that are interested in train-tiny-llm are comparing it to the libraries listed below

Sorting:

FareedKhan-dev / gpt4o-from-scratch
Implementation of a GPT-4o like Multimodal from Scratch using Python
☆69Updated 3 months ago
hesamsheikh / llm-mechanics
Coding an LLM and its building blocks from scratch.
☆46Updated 3 months ago
peremartra / optipfair
Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and fairness analysis.
☆13Updated this week
ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆34Updated 2 months ago
FareedKhan-dev / ai-debugger
An automated Python tool that uses LLMs and internet to automatically fix your code until it runs perfectly.
☆26Updated 6 months ago
FareedKhan-dev / gemini-AI-copilot
Intelligent Help for Efficient Programming
☆18Updated last year
ALucek / LLM-distillation-guide
☆21Updated 11 months ago
AI-Maker-Space / Build-Your-Own-RAG-System
This repository contains a toy implementation of a basic RAQA system.
☆20Updated last year
colabre2020 / LSTM-BERT-stock-predictor
☆85Updated 2 months ago
FareedKhan-dev / create-million-parameter-llm-from-scratch
Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.
☆180Updated last year
jeremyarancio / VLM-Batch-Deployment
Batch Deployment for Document Parsing with AWS Batch & Qwen-2.5-VL
☆47Updated 2 months ago
FareedKhan-dev / train-llama4
Building LLaMA 4 MoE from Scratch
☆57Updated 3 months ago
hesamsheikh / AI-Researcher-Agent
☆19Updated last year
FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆172Updated 11 months ago
jayita13 / GenerativeAI
GenAI Experimentation
☆57Updated last week
lucifertrj / Lats-Agent-RecSys
Build a Recommendation System Agent using LATS Agent Approach
☆32Updated 4 months ago
tonykipkemboi / crewai-mcp-demo
Repository for CrewAI MCP demo codebase
☆24Updated last week
JohnMachado11 / Build-a-Large-Language-Model-from-Scratch
Building a GPT-like LLM from scratch with PyTorch.
☆267Updated 7 months ago
rsrohan99 / llamaindex-trip-planner
AI tour planner agent using LlamaIndex Workflow
☆46Updated 6 months ago
aymenfurter / smartrag
Deep Research through Multi-Agents, using GraphRAG
☆76Updated 8 months ago
plaban1981 / Crewai-MCP
Build an MCP agent using Crewai
☆30Updated last month
mlabonne / how-to-data-science
Scripts, notebooks, and articles about data science in general.
☆47Updated 2 years ago
keitazoumana / LLMs
Repository for my LLM notebooks
☆28Updated 11 months ago
peremartra / FinLLMOpt
Optimized Large Language Models for Financial Applications – Efficient, Scalable, and Domain-Specific AI for Finance.
☆50Updated 3 weeks ago
madhukarkumar / agentic-rag
An Agentic RAG starter that use Swarm, Nemo Guardrails and SingleStore as a database
☆24Updated 7 months ago
Paulescu / plot-generator-agent
Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️
☆48Updated last year
apatti / AIEBootcamp
AI Engineering bootcamp
☆93Updated 4 months ago
huggingface / huggingface-gemma-recipes
Inference, Fine Tuning and many more recipes with Gemma family of models
☆259Updated last week
InnovatingAI / AutoMind
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
☆48Updated this week
FareedKhan-dev / text2video-from-scratch
A Straightforward, Step-by-Step Implementation of a Video Diffusion Model
☆50Updated 2 months ago