Mayankpratapsingh022 / DeepSeek-from-ScratchLinks
☆57Updated 4 months ago
Alternatives and similar repositories for DeepSeek-from-Scratch
Users that are interested in DeepSeek-from-Scratch are comparing it to the libraries listed below
Sorting:
- 📓 A collection of generative AI open-source repositories that are actively being developed. If you are looking to build a solid profile …☆83Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 4 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆268Updated last week
- Learn the building blocks of how to build gpt-oss from scratch☆105Updated 2 months ago
- Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK☆309Updated last week
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆211Updated 2 weeks ago
- Collection of impressive LLM apps with a focus on the financial sector☆141Updated 3 weeks ago
- Verifiers for LLM Reinforcement Learning☆78Updated 2 months ago
- An end-to-end Data Scientist☆197Updated last week
- ☆25Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆36Updated 6 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆449Updated 3 months ago
- dLLM: Simple Diffusion Language Modeling☆1,022Updated this week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last week
- ☆86Updated last year
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆205Updated 3 months ago
- ☆46Updated 7 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆255Updated last month
- Simple examples using Argilla tools to build AI☆56Updated last year
- Coding an LLM and its building blocks from scratch.☆101Updated 8 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆166Updated 3 months ago
- Context Engineering Course with DSPy☆202Updated 4 months ago
- ☆98Updated 8 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆479Updated 3 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆390Updated 2 weeks ago
- ☆182Updated 9 months ago
- ☆300Updated 3 months ago
- API Server for Transformer Lab☆80Updated last week
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆73Updated 7 months ago
- Train LLM on Hugging Face infra☆67Updated 2 weeks ago