Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.
☆29Mar 11, 2025Updated 11 months ago
Alternatives and similar repositories for TinyDeepSeek
Users that are interested in TinyDeepSeek are comparing it to the libraries listed below
Sorting:
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆39Jun 4, 2025Updated 8 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆30Jul 9, 2025Updated 7 months ago
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated 10 months ago
- [ICLR'25] ApolloMoE: Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts☆52Nov 20, 2024Updated last year
- Depth maps Super Resolution using PaddlePaddle☆24Nov 20, 2022Updated 3 years ago
- 持续追踪ChatGPT相关的技术资料和行业进展。☆11Apr 24, 2023Updated 2 years ago
- PaddleClas ShiTu Image Manager PP-ShiTu 库管理工具☆18Jan 30, 2023Updated 3 years ago
- ☆30Aug 21, 2025Updated 6 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- Common tools for data processing☆22Dec 8, 2025Updated 2 months ago
- Web application for real-time object detection 🔎 using Flask 🌶, OpenCV, and YoloV3 weights. It uses the COCO Dataset 🖼.☆16Apr 19, 2021Updated 4 years ago
- ☆20Jan 6, 2023Updated 3 years ago
- 🔨🔨🔨Tool for making model training data set☆20Nov 1, 2024Updated last year
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆86Jan 4, 2026Updated last month
- DEYOv1.5☆29Jul 22, 2024Updated last year
- 📖收集国内外深度学习大模型API、论文、案例与学习资料,欢迎Star🌟☆31May 12, 2022Updated 3 years ago
- Our 2nd-gen LMM☆34May 22, 2024Updated last year
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- Twinkle✨: Training workbench to make your model glow.☆45Updated this week
- ☆12Jan 21, 2025Updated last year
- ✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆43Apr 10, 2025Updated 10 months ago
- Our repo containes a Efficient RGB-D features extractor to category-level and instance-level 6D pose estimation.☆14Oct 29, 2025Updated 4 months ago
- Modern normalizing flows in Python. Simple to use and easily extensible.☆12Feb 11, 2026Updated 2 weeks ago
- Azure Machine Learning - MLOps Python SDKv2☆10Jul 24, 2023Updated 2 years ago
- OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models☆29Feb 4, 2026Updated 3 weeks ago
- something for paper agent☆11Dec 18, 2024Updated last year
- [NeurIPS 2025] A multi-agent framework that leverages LLMs to simulate socio-economic systems☆45Oct 18, 2025Updated 4 months ago
- A scalable data preprocessing framework built on PySpark for LLM training☆22Dec 9, 2025Updated 2 months ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- A collection of research on specialized medical LLMs for specific diseases and distinct medical specialties, organized by ICD-10 chapters…☆32Oct 10, 2025Updated 4 months ago
- [npj Digital Medicine] A multimodal multidomain multilingual medical foundation model for zero shot clinical diagnosis☆17Feb 6, 2025Updated last year
- Creating Your Divine Agent 😇☆10Jan 26, 2026Updated last month
- Python solutions to coding questions in Leetcode☆13Sep 12, 2020Updated 5 years ago
- ☆10Aug 7, 2021Updated 4 years ago
- My templates used in OI. All C++.☆11Jul 17, 2018Updated 7 years ago
- Advanced Multi-Agent Optimization System featuring intelligent routing strategies, semantic memory optimization, distributed coordination…☆16Aug 15, 2025Updated 6 months ago
- Color detection, Contour mapping, Detecting holes, Motion detection☆10Mar 20, 2014Updated 11 years ago