A collection on the recent reproduction papers and projects on DeepSeek-R1
☆31Feb 27, 2025Updated last year
Alternatives and similar repositories for awesome-deepseek-r1
Users that are interested in awesome-deepseek-r1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code of paper *Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization*.☆18Mar 26, 2022Updated 4 years ago
- This is the code for G2MILP, a deep learning-based mixed-integer linear programming (MILP) instance generator.☆36Oct 3, 2024Updated last year
- ☆22Jan 26, 2024Updated 2 years ago
- Must-read papers on Reinforcement Learning (RL)☆54Nov 9, 2020Updated 5 years ago
- This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.☆19Apr 27, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Scaling Agentic Environments Automatically.☆64Mar 26, 2026Updated 2 months ago
- This is the code for our ICLR 2025 paper, titled Computing Circuits Optimization via Model-Based Circuit Genetic Evolution.☆13May 27, 2025Updated last year
- PyTorch implementation for our ICCV 2023 paper Not Every Side Is Equal: Localization Uncertainty Estimation for Semi-Supervised 3D Object…☆13May 27, 2024Updated 2 years ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆29Sep 27, 2024Updated last year
- PDEAgentBench: An automated benchmark framework for evaluating Code Agents on optimizing scientific PDE solvers.☆95Updated this week
- DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting☆18Mar 4, 2025Updated last year
- Experiments with reasoning models, training techniques, papers☆30Updated this week
- Bombing AI agents☆12Jun 21, 2018Updated 7 years ago
- ☆10Jul 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Jun 18, 2025Updated 11 months ago
- Toolkit for VIPER benchmark☆16Aug 11, 2020Updated 5 years ago
- The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Ch…☆29Jul 15, 2025Updated 10 months ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- ☆19Nov 10, 2024Updated last year
- The implementation of ACL 2026 paper "Rethinking entropy interventions in rlvr: An entropy change perspective"☆17Jan 15, 2026Updated 4 months ago
- A curated paper list for "Foundation Neural Operators: A Survey on Pretraining Methods, Data Ecosystems, and Efficient Adaptation".☆37Feb 14, 2026Updated 3 months ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- ☆15Jun 3, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated last year
- ☆44Mar 6, 2026Updated 3 months ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated last year
- ☆16May 22, 2025Updated last year
- Synthetic data library used in operator learning for PDE problems that overcomes dependence on classical solvers such as finite differenc…☆18Aug 8, 2024Updated last year
- Code for "A Multi-Task BERT Model for Schema-Guided Dialogue State Tracking"☆14May 26, 2023Updated 3 years ago
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"☆13Jun 14, 2023Updated 2 years ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆25Sep 26, 2024Updated last year
- USTC computer practice code and interesting small projects☆13Apr 24, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆28Feb 26, 2024Updated 2 years ago
- USTC研究生学术报告选课脚本☆18Dec 6, 2022Updated 3 years ago
- This is the repository for EMNLP 2022 paper "Efficient Zero-shot Event Extraction with Context-Definition Alignment"☆11Dec 27, 2024Updated last year
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- ☆17Aug 1, 2025Updated 10 months ago
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆13Sep 21, 2022Updated 3 years ago
- ☆29Jul 16, 2024Updated last year