liyuan24/deepseek_from_scratch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liyuan24/deepseek_from_scratch)

liyuan24 / deepseek_from_scratch

☆18

Alternatives and similar repositories for deepseek_from_scratch

Users that are interested in deepseek_from_scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JRPan / crisp-artifact
View on GitHub
☆15Feb 5, 2025Updated last year
s-sahoo / scaling-dllms
View on GitHub
[ICML 2026] Scaling Beyond Masked Diffusion Language Models
☆31Jul 3, 2026Updated 3 weeks ago
yifu-ding / BGEMM-CUDA
View on GitHub
BGEMM-CUDA is a CUDA-based low-bit GEMM kernel library for efficient neural network inference. It implements optimized binary and ternary…
☆20Aug 30, 2024Updated last year
roymiles / VeLoRA
View on GitHub
[NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections
☆22Oct 15, 2024Updated last year
AMA-CMFAI / DARE
View on GitHub
This is the codes of "DARE: Aligning LLM Agents with the R Statistical Ecosystem via Distribution-Aware Retrieval"
☆16Mar 6, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FareedKhan-dev / AI-outlier-detection
View on GitHub
Outlier Detection with AI + ML
☆15Sep 12, 2025Updated 10 months ago
apoorvumang / knowledge-cutoff
View on GitHub
Benchmark to measure what the real knowledge cutoff of a model is
☆15Jul 10, 2026Updated 2 weeks ago
stanford-cs336 / spring2024-assignment5-alignment
View on GitHub
☆15Jun 12, 2024Updated 2 years ago
VAGOsolutions / SauerkrautLM-Doom-MultiVec
View on GitHub
A tiny 1.3M parameter model that plays DOOM, outperforming LLMs up to 92,000x its size.
☆26May 11, 2026Updated 2 months ago
amazon-science / factual-confidence-of-llms
View on GitHub
Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"
☆17Dec 4, 2024Updated last year
hkproj / multi-latent-attention
View on GitHub
☆46May 24, 2025Updated last year
adwaitjog / mafia
View on GitHub
MAFIA: Multiple Application Framework for GPU architectures
☆28Jan 21, 2022Updated 4 years ago
ibadrather / pytorch_learn
View on GitHub
Learning Pytorch
☆13Oct 31, 2023Updated 2 years ago
jaehyun1ee / standalone-ddl
View on GitHub
2019 딥러닝-비전처리 홀로서기 특강에 사용된 Lecture Note 및 Code Repository입니다.
☆12Sep 7, 2019Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
jlwhelan28 / pac-hunter
View on GitHub
Lookup donation history of a Political Action Committee to specific US federal election candidates using data sourced directly from the F…
☆14Nov 14, 2022Updated 3 years ago
Emperor-WS / PyEmber
View on GitHub
An Educational Framework Based on PyTorch for Deep Learning Education and Exploration
☆11Dec 24, 2023Updated 2 years ago
Infatoshi / rl-handbook
View on GitHub
Code companion for the RL Post-Training Handbook - training reasoning models on a single GPU
☆19Jan 30, 2026Updated 5 months ago
muellerzr / smol-moe
View on GitHub
☆25Oct 10, 2025Updated 9 months ago
helpingstar / pika-zoo
View on GitHub
🕹 Pikachu-volleyball game-based multi-agent RL environment using PettingZoo
☆11Sep 29, 2024Updated last year
yoonholee / DivDis
View on GitHub
☆38Oct 21, 2022Updated 3 years ago
stanford-cs336 / spring2024-assignment2-systems
View on GitHub
☆19May 3, 2024Updated 2 years ago
ezoerner / solutions-learn-physics-with-fp
View on GitHub
Solutions to exercises in the book *Learn Physics with Functional Programming*
☆11Mar 8, 2025Updated last year
JNYH / DataCamp_Machine_Learning_with_Tree-Based_Models
View on GitHub
This is a memo to share what I have learnt in Machine Learning with Tree-Based Models (using Python)
☆19Oct 16, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
paulveillard / cybersecurity-API-security-checklist
View on GitHub
An ongoing collection of awesome software, API libraries, checlists, best guidelines and resources and most important security countermea…
☆14Nov 15, 2022Updated 3 years ago
sionic-ai / pycon-2024-tutorial
View on GitHub
2024 PyCon Korea 튜토리얼
☆12Nov 8, 2024Updated last year
Layr-Labs / rust-kzg-bn254
View on GitHub
☆15Nov 22, 2025Updated 8 months ago
ServiceNow / drbench
View on GitHub
An enterprise deep research benchmark
☆40Apr 22, 2026Updated 3 months ago
napo / rotazionivolley
View on GitHub
rappresentazione via web degli schemi di ricezione a 3 nella pallavolo rispetto alla posizione del palleggiatore
☆15Jan 4, 2019Updated 7 years ago
camel-ai / gecko
View on GitHub
☆35Jul 8, 2026Updated 2 weeks ago
gkouros / coursera-robotics-perception-mooc
View on GitHub
Contains notes and assignment solutions for the Robotics Perception MOOC offered by coursera
☆12Jun 19, 2020Updated 6 years ago
HiddenBeginner / Deep-Reinforcement-Learnings
View on GitHub
심층강화학습 책 https://hiddenbeginner.github.io/Deep-Reinforcement-Learnings
☆11May 10, 2024Updated 2 years ago
Moonsong-Labs / madara-prover-api
View on GitHub
RPC server and client to run the Stone Prover on the Madara sequencer.
☆11Oct 16, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
The-Cryptography / C
View on GitHub
All Cryptography Algorithms are implemented in C.
☆12Mar 28, 2021Updated 5 years ago
nburgessx / OxfordMBA
View on GitHub
Financial Strategy Resources
☆18May 21, 2022Updated 4 years ago
ritikraj7 / cpu-centric-agentic-ai
View on GitHub
A comprehensive benchmarking framework for evaluating and optimizing CPU-centric agentic AI systems across multiple workloads, reproducin…
☆48Feb 12, 2026Updated 5 months ago
WeihongLi-ac / Awesome-Multi-Domain-Multi-Task-Learning
View on GitHub
An up-to-date list of works on Multi-domain Multi-task learning
☆18Oct 20, 2022Updated 3 years ago
cshannonn / blackscholes_nas
View on GitHub
Can a neural network learn Black Scholes, yes...
☆10Dec 17, 2018Updated 7 years ago
code4DB / Index_EAB
View on GitHub
☆13Jul 11, 2025Updated last year
encrypted-def / my-ctf-challenges
View on GitHub
My ctf challenges, mostly cryptography
☆17Jul 13, 2025Updated last year