saketd403 / train-llm-from-scratchLinks
Train LLMs such as GPT and LLama from scratch.
☆12Updated 3 months ago
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- ☆13Updated last month
- Just like the beloved character Doraemon who pulls out gadgets from his pocket, this agent can dynamically create, save, and utilize its …☆16Updated 4 months ago
- ☆36Updated 2 weeks ago
- A category wise collection of 200+ LLM survey papers.☆151Updated 2 months ago
- Fine tune Gemma 3 on an object detection task☆46Updated this week
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆196Updated last month
- ☆113Updated 6 months ago
- ☆7Updated 6 years ago
- Composition of Multimodal Language Models From Scratch☆14Updated 9 months ago
- Compare open-source LLM agentic projects by their metrics to assess popularity and activeness.☆12Updated 3 weeks ago
- Official implementation of the WASP web agent security benchmark☆23Updated 3 weeks ago
- A fully custom chatbot built with Agentic RAG (Retrieval-Augmented Generation), combining Gemini models with a local knowledge base for a…☆142Updated 3 months ago
- OpenPipe Reinforcement Learning Experiments☆25Updated 2 months ago
- 📝 Automatically annotate papers using LLMs☆322Updated last month
- ☆53Updated last month
- AI Engineering bootcamp☆90Updated 2 months ago
- Blueprint for federated finetuning, enabling multiple data owners to collaboratively fine-tune models without sharing raw data. Developed…☆35Updated last week
- ☆47Updated 2 months ago
- ☆22Updated 8 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆32Updated 3 weeks ago
- Diagnose the performance of your RAG🩺☆36Updated 2 months ago
- ☆20Updated 2 weeks ago
- Query Only Linear Adapter Training for Fine Tuned Embedding Model Query Representation☆19Updated 8 months ago
- Proposed Standard for AI.txt☆18Updated 2 years ago
- Simple Flutter app to make API calls☆10Updated 6 years ago
- Train transformer language models with reinforcement learning.☆19Updated 3 months ago
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆148Updated last month
- Notebooks for fine tuning pali gemma☆107Updated last month
- Source code for the collaborative reasoner research project at Meta FAIR.☆87Updated last month
- Train a 29M parameter GPT from Scratch☆16Updated 3 months ago