FareedKhan-dev / train-tiny-llmLinks
Train a 29M parameter GPT from Scratch
☆20Updated 4 months ago
Alternatives and similar repositories for train-tiny-llm
Users that are interested in train-tiny-llm are comparing it to the libraries listed below
Sorting:
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆69Updated 3 months ago
- Coding an LLM and its building blocks from scratch.☆46Updated 3 months ago
- Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and fairness analysis.☆13Updated this week
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆34Updated 2 months ago
- An automated Python tool that uses LLMs and internet to automatically fix your code until it runs perfectly.☆26Updated 6 months ago
- Intelligent Help for Efficient Programming☆18Updated last year
- ☆21Updated 11 months ago
- This repository contains a toy implementation of a basic RAQA system.☆20Updated last year
- ☆85Updated 2 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆180Updated last year
- Batch Deployment for Document Parsing with AWS Batch & Qwen-2.5-VL☆47Updated 2 months ago
- Building LLaMA 4 MoE from Scratch☆57Updated 3 months ago
- ☆19Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆172Updated 11 months ago
- GenAI Experimentation☆57Updated last week
- Build a Recommendation System Agent using LATS Agent Approach☆32Updated 4 months ago
- Repository for CrewAI MCP demo codebase☆24Updated last week
- Building a GPT-like LLM from scratch with PyTorch.☆267Updated 7 months ago
- AI tour planner agent using LlamaIndex Workflow☆46Updated 6 months ago
- Deep Research through Multi-Agents, using GraphRAG☆76Updated 8 months ago
- Build an MCP agent using Crewai☆30Updated last month
- Scripts, notebooks, and articles about data science in general.☆47Updated 2 years ago
- Repository for my LLM notebooks☆28Updated 11 months ago
- Optimized Large Language Models for Financial Applications – Efficient, Scalable, and Domain-Specific AI for Finance.☆50Updated 3 weeks ago
- An Agentic RAG starter that use Swarm, Nemo Guardrails and SingleStore as a database☆24Updated 7 months ago
- Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️☆48Updated last year
- AI Engineering bootcamp☆93Updated 4 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆259Updated last week
- AutoMind: Adaptive Knowledgeable Agent for Automated Data Science☆48Updated this week
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆50Updated 2 months ago