Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.
☆28Mar 11, 2025Updated last year
Alternatives and similar repositories for TinyDeepSeek
Users that are interested in TinyDeepSeek are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆40Jun 4, 2025Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Jan 4, 2024Updated 2 years ago
- Depth maps Super Resolution using PaddlePaddle☆24Nov 20, 2022Updated 3 years ago
- 持续追踪ChatGPT相关的技术资料和行业进展。☆11Apr 24, 2023Updated 3 years ago
- 📖收集国内外深度学习大模型API、论文、案例与学习资料,欢迎Star🌟☆31May 12, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Antigen-receptor Design Against Peptide-MHC Targets☆21Jan 9, 2026Updated 5 months ago
- Common tools for data processing☆22Dec 8, 2025Updated 6 months ago
- A minimal PyTorch implementation of BERT (Bidirectional Encoder Representations from Transformers)☆12Mar 20, 2023Updated 3 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆15Sep 28, 2024Updated last year
- ☆19Oct 30, 2025Updated 8 months ago
- LLM implementation one matrix multiplication at a time☆13Aug 8, 2024Updated last year
- Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…☆15Jun 13, 2026Updated 3 weeks ago
- Conversational Multimodal Emotion Recognition☆12Dec 7, 2020Updated 5 years ago
- Meta-analysis of drug target evidence in single-cell data☆17Oct 22, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17Sep 16, 2023Updated 2 years ago
- ☆11May 27, 2021Updated 5 years ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆20Jan 11, 2026Updated 5 months ago
- Code for "SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics".☆52Sep 2, 2025Updated 10 months ago
- something for paper agent☆11Dec 18, 2024Updated last year
- SimKO: Simple Pass@K Policy Optimization☆31Oct 24, 2025Updated 8 months ago
- ☆19May 25, 2024Updated 2 years ago
- ☆14Oct 8, 2016Updated 9 years ago
- ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…☆164Feb 27, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- AlphaGenome PyTorch port☆147Jun 27, 2026Updated last week
- Groq-powered MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆12Jul 5, 2024Updated last year
- [ISBI 2024] Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation…☆16Feb 23, 2025Updated last year
- [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models☆149Oct 10, 2025Updated 8 months ago
- ☆12Jan 21, 2025Updated last year
- An interactive utility that breaks the interdependency between deep learning and coding☆11Aug 30, 2020Updated 5 years ago
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Sep 14, 2023Updated 2 years ago
- Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.☆21Jul 18, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- General neural tactic for Lean 4☆31Updated this week
- Formalisation of the Kelley-Meka bound on Roth numbers☆26Jun 18, 2026Updated 2 weeks ago
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆24Apr 13, 2026Updated 2 months ago
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆29Apr 17, 2025Updated last year
- ☆170Jun 1, 2026Updated last month
- Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…☆20Jun 13, 2025Updated last year
- Multilingual Medicine: Model, Dataset, Benchmark, Code☆199Oct 15, 2024Updated last year