A straightforward explanation of how DeepSeek R1 works
☆18Feb 7, 2025Updated last year
Alternatives and similar repositories for DeepSeek-R1-from-scratch
Users that are interested in DeepSeek-R1-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆82Aug 18, 2025Updated 9 months ago
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Jul 6, 2025Updated 10 months ago
- A curated collection of prompts for Grok Imagine by xAI☆29Oct 19, 2025Updated 7 months ago
- ☆14Jan 30, 2025Updated last year
- A practical demo using Atomic Agents, showing how to build your own code generation agent that actually executes the code it writes in a …☆15Nov 24, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆14Nov 16, 2024Updated last year
- ☆15Apr 21, 2024Updated 2 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆19Sep 13, 2024Updated last year
- A PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE) with EMA updates, pretrained encoder, and K-means initializ…☆21Mar 26, 2026Updated last month
- Tools to easily integrate Anthropic Model Context Protocol(MCP) with Langchain☆17Feb 17, 2025Updated last year
- A replication of the paper "Adaptive Mixtures of Local Experts" applied to the CIFAR-10 image classification dataset.☆12Mar 19, 2021Updated 5 years ago
- Examples to use Azure with LLMs for Chat☆18Jan 8, 2024Updated 2 years ago
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆23Feb 12, 2026Updated 3 months ago
- Demonstration showing how to deploy Streamlit using Azure App Services☆17Oct 23, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Easily make and share gifs of your favorite YouTube moments. Built to self host with Python, AI, and Docker. Free and open source.☆17Dec 3, 2024Updated last year
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆14May 6, 2024Updated 2 years ago
- Automated agent using LangChain and Gmail API to classify and respond to incoming emails based on their content.☆14Oct 12, 2024Updated last year
- Various examples for using Not Diamond to route model prompts.☆19Jun 17, 2025Updated 11 months ago
- A repo to accopmany my youtube video on how to build an AI receptionist with langgraph☆16Aug 23, 2024Updated last year
- Perplexity Lite using Langgraph, Tavily, and GPT-4.☆25May 1, 2024Updated 2 years ago
- ☆24Jun 12, 2024Updated last year
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Apr 4, 2025Updated last year
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆21Jun 29, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch☆20Jun 3, 2024Updated last year
- Source Code for the ICML 2020 Paper on Uncertainty & Robustness in Deep Learning☆17Aug 28, 2023Updated 2 years ago
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆31May 11, 2026Updated last week
- This repo implements and trains DallE-1 on a synthetically generated dataset which has colored mnist images on texture/solid background a…☆13Oct 30, 2024Updated last year
- Temporal matrix factorization for sparse traffic time series forecasting.☆60May 16, 2025Updated last year
- Trained a 114 million Parameter LLM from Scratch.☆19Jul 21, 2024Updated last year
- An example showcasing how to create an agent with persistent long-term memory using Atomic Agents☆26Dec 15, 2024Updated last year
- An LLM Chatbot based on LangGraph and LangChain that dynamically retrieves and processes resumes using RAG to perform resume screening.☆28Aug 29, 2024Updated last year
- A better job search based on semantic matching☆17Nov 22, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition☆18Apr 25, 2021Updated 5 years ago
- Copy My Writing is a command-line tool for generating content based on your personal writing style.☆11Oct 12, 2025Updated 7 months ago
- AgenticSearch operates within an agentic workflow, utilizing Gemini 2.0 and an extensive tool registry to handle complex questions. By in…☆32Jan 16, 2025Updated last year
- HR Bot that ranks CVs using matching between with job descriptions and the CVs☆23Aug 20, 2023Updated 2 years ago
- Flow control nodes for comfyUI, allowing for more diverse workflows☆13Apr 3, 2025Updated last year
- A collection of AI tutorials from Dr. Ashish Bamania☆34Apr 15, 2026Updated last month
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆30May 12, 2026Updated last week