saketd403 / train-llm-from-scratchLinks

Train LLMs such as GPT and LLama from scratch.

☆12

Alternatives and similar repositories for train-llm-from-scratch

Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below

Sorting:

ServiceNow / agent-poirot
☆13Updated last month
g-hano / Doraemon-Agent
Just like the beloved character Doraemon who pulls out gadgets from his pocket, this agent can dynamically create, save, and utilize its …
☆16Updated 4 months ago
hkproj / multi-latent-attention
☆36Updated 2 weeks ago
KalyanKS-NLP / LLM-Survey-Papers-Collection
A category wise collection of 200+ LLM survey papers.
☆151Updated 2 months ago
ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆46Updated this week
YuvrajSingh-mist / Paper-Replications
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆196Updated last month
kturung / colpali-llama-vision-rag
☆113Updated 6 months ago
muhammedessa / flutter-API-CRUD-with-login-http-only
☆7Updated 6 years ago
alexander-moore / vlm
Composition of Multimodal Language Models From Scratch
☆14Updated 9 months ago
vince-lam / awesome-agents
Compare open-source LLM agentic projects by their metrics to assess popularity and activeness.
☆12Updated 3 weeks ago
facebookresearch / wasp
Official implementation of the WASP web agent security benchmark
☆23Updated 3 weeks ago
AndrewNgo-ini / agentic_rag
A fully custom chatbot built with Agentic RAG (Retrieval-Augmented Generation), combining Gemini models with a local knowledge base for a…
☆142Updated 3 months ago
OpenPipe / rl-experiments
OpenPipe Reinforcement Learning Experiments
☆25Updated 2 months ago
neuml / annotateai
📝 Automatically annotate papers using LLMs
☆322Updated last month
saimeghana-y / Transformer-CUDA
☆53Updated last month
apatti / AIEBootcamp
AI Engineering bootcamp
☆90Updated 2 months ago
mozilla-ai / federated-finetuning
Blueprint for federated finetuning, enabling multiple data owners to collaboratively fine-tune models without sharing raw data. Developed…
☆35Updated last week
shreyaskarnik / huggingface-mcp-server
☆47Updated 2 months ago
jrzmnt / rl-vs-llm-chess
☆22Updated 8 months ago
ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆32Updated 3 weeks ago
AstraBert / diRAGnosis
Diagnose the performance of your RAG🩺
☆36Updated 2 months ago
thu-coai / Backdoor-Data-Extraction
☆20Updated 2 weeks ago
ALucek / linear-adapter-embedding
Query Only Linear Adapter Training for Fine Tuned Embedding Model Query Representation
☆19Updated 8 months ago
menro / ai.txt
Proposed Standard for AI.txt
☆18Updated 2 years ago
RaglandCodes / Flutter-basic-API
Simple Flutter app to make API calls
☆10Updated 6 years ago
willccbb / trl
Train transformer language models with reinforcement learning.
☆19Updated 3 months ago
qubvel / transformers-notebooks
Inference and fine-tuning examples for vision models from 🤗 Transformers
☆148Updated last month
ariG23498 / fine-tune-paligemma
Notebooks for fine tuning pali gemma
☆107Updated last month
facebookresearch / collaborative-reasoner
Source code for the collaborative reasoner research project at Meta FAIR.
☆87Updated last month
FareedKhan-dev / train-tiny-llm
Train a 29M parameter GPT from Scratch
☆16Updated 3 months ago