FareedKhan-dev / DeepSeek-R1-from-scratchLinks
A straightforward explanation of how DeepSeek R1 works
☆16Updated 11 months ago
Alternatives and similar repositories for DeepSeek-R1-from-scratch
Users that are interested in DeepSeek-R1-from-scratch are comparing it to the libraries listed below
Sorting:
- Building LLMs from scratch following the book from S. Raschka☆32Updated 10 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- SwiftLet is a lightweight Python framework for running open-source Large Language Models (LLMs) locally using safetensors☆28Updated 5 months ago
- This repo gives a start for the docker.☆36Updated 2 years ago
- Modified Beam Search with periodical restart☆12Updated last year
- Implementation of Liquid Nets in Pytorch☆69Updated 2 weeks ago
- Understanding Large Language Transformer Architecture like a child☆28Updated last year
- ☆15Updated last year
- Reinforcement learning framework.☆16Updated 6 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 11 months ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Updated this week
- ☆45Updated 8 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Updated 9 months ago
- ☆46Updated 10 months ago
- ☆11Updated 2 years ago
- Creating the DeepSeek V3 model from scratch☆24Updated 10 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆197Updated last year
- Simple GRPO scripts and configurations.☆59Updated 11 months ago
- ☆13Updated 2 years ago
- This course aims to teach from the basics of RL to advanced algorithms such as PPO.☆32Updated this week
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated last year
- code for training and using chess embeddings models☆13Updated last year
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Updated 6 months ago
- Your friendly investment advisor has now turned into an LLM chatbot!☆14Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated last year
- Encountering 14 different Naive RAG fails and using KG to solve it☆20Updated last month
- Finetune any model on HF in less than 30 seconds☆56Updated 2 weeks ago