VizuaraAI / DeepSeek-From-ScratchLinks
Learn the building blocks of how to build DeepSeek from scratch.
☆89Updated 3 months ago
Alternatives and similar repositories for DeepSeek-From-Scratch
Users that are interested in DeepSeek-From-Scratch are comparing it to the libraries listed below
Sorting:
- ☆69Updated 5 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆75Updated 9 months ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆82Updated 4 months ago
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆216Updated last week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last week
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆400Updated 2 months ago
- repo of paper implementations☆20Updated 10 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆16Updated 9 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆108Updated 3 months ago
- GPU Kernels☆217Updated 8 months ago
- ☆89Updated 9 months ago
- Train a language model to chat like you using your personal conversations from WhatsApp, Telegram, Signal, or other platforms.☆241Updated 3 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- ☆102Updated last year
- Building LLaMA 4 MoE from Scratch☆71Updated 8 months ago
- Code repository dedicated to experimenting and research with tiny reasoning language model☆43Updated last month
- 📓 A collection of generative AI open-source repositories that are actively being developed. If you are looking to build a solid profile …☆85Updated 3 months ago
- ☆46Updated 9 months ago
- A straightforward method for training your LLM, from downloading data to generating text.☆503Updated 5 months ago
- ☆45Updated 8 months ago
- Docs for GGUF quantization (unofficial)☆343Updated 5 months ago
- This repository contains an exhaustive coverage of a hands on approach to PyTorch along side powerful tools to accelerate model tuning an…☆215Updated 3 weeks ago
- Implementations of Papers that I read, you can read my breakdown in my blog☆89Updated 2 months ago
- Building a GPT-like LLM from scratch with PyTorch.☆327Updated last year
- ☆59Updated 3 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 7 months ago
- Basically a repo containing architectures/algorithms/papers from scratch in pytorch☆30Updated 2 months ago
- ☆113Updated last month
- ☆408Updated 9 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 4 months ago