A straightforward explanation of how DeepSeek R1 works
☆18Feb 7, 2025Updated last year
Alternatives and similar repositories for DeepSeek-R1-from-scratch
Users that are interested in DeepSeek-R1-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆37Jul 6, 2025Updated 11 months ago
- ☆12Dec 14, 2024Updated last year
- All the content of my youtube channel : https://youtube.com/@florenzerstling?si=7t10PBr6MDha74PO☆14May 28, 2025Updated last year
- ☆14Nov 16, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Understanding Large Language Transformer Architecture like a child☆34Apr 3, 2024Updated 2 years ago
- PoC for visualizing Graphs with React, D3 and FastAPI☆20Aug 27, 2024Updated last year
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- ☆15Apr 21, 2024Updated 2 years ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆209Aug 23, 2024Updated last year
- A PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE) with EMA updates, pretrained encoder, and K-means initializ…☆22Mar 26, 2026Updated 3 months ago
- Tools to easily integrate Anthropic Model Context Protocol(MCP) with Langchain☆17Feb 17, 2025Updated last year
- Implementation of various data science techniques and research papers☆31Dec 15, 2024Updated last year
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆25Feb 12, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Demonstration showing how to deploy Streamlit using Azure App Services☆17Oct 23, 2023Updated 2 years ago
- Easily make and share gifs of your favorite YouTube moments. Built to self host with Python, AI, and Docker. Free and open source.☆17Dec 3, 2024Updated last year
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆14May 6, 2024Updated 2 years ago
- Automated agent using LangChain and Gmail API to classify and respond to incoming emails based on their content.☆14Oct 12, 2024Updated last year
- Various examples for using Not Diamond to route model prompts.☆19Jun 17, 2025Updated last year
- Environment equipped with reinforcement learning algorithms to train agents to play tic-tac-toe.☆13Mar 4, 2023Updated 3 years ago
- ☆20Feb 2, 2025Updated last year
- Perplexity Lite using Langgraph, Tavily, and GPT-4.☆25May 1, 2024Updated 2 years ago
- ☆24Jun 12, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Apr 4, 2025Updated last year
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆22Jun 29, 2024Updated 2 years ago
- Source Code for the ICML 2020 Paper on Uncertainty & Robustness in Deep Learning☆17Aug 28, 2023Updated 2 years ago
- Trained a 114 million Parameter LLM from Scratch.☆19Jul 21, 2024Updated last year
- An example showcasing how to create an agent with persistent long-term memory using Atomic Agents☆26Dec 15, 2024Updated last year
- Repository for CrewAI MCP demo codebase☆37Jul 17, 2025Updated 11 months ago
- A better job search based on semantic matching☆17Nov 22, 2024Updated last year
- Jax/Flax implementation of Denoising Diffusion Implicit Models☆20Jul 18, 2022Updated 3 years ago
- Copy My Writing is a command-line tool for generating content based on your personal writing style.☆11Oct 12, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Flow control nodes for comfyUI, allowing for more diverse workflows☆13Apr 3, 2025Updated last year
- Ready to use whisper.cpp models implementation for iOS and Android☆25Sep 4, 2023Updated 2 years ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆31Jun 22, 2026Updated last week
- Demo implementation of a reactive multi agent bot that can answer questions based on relational database information to demonstrate diffe…☆32Mar 4, 2025Updated last year
- Conformer RNN-Transducer☆14May 25, 2022Updated 4 years ago
- A LLM Agent with Langchain/Langgraph helps to analyze CV, look relevant jobs via API, and write a cover letter according to it☆61May 1, 2024Updated 2 years ago
- Secure Dynamic Session Agent for AI generated code execution built using OpenAI, Langchain and Azure Container Apps☆26Nov 27, 2024Updated last year