A collection on the recent reproduction papers and projects on DeepSeek-R1
☆32Feb 27, 2025Updated last year
Alternatives and similar repositories for awesome-deepseek-r1
Users that are interested in awesome-deepseek-r1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code of paper *Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization*.☆18Mar 26, 2022Updated 4 years ago
- The code of paper LMC: Fast Training of GNNs via Subgraph Sampling with Provable Convergence. Zhihao Shi, Xize Liang, Jie Wang. ICLR 2023…☆47Feb 15, 2023Updated 3 years ago
- Code for "Holistic Physics Solver: Learning PDEs in a Unified Spectral-Physical Space"☆21Mar 25, 2026Updated 2 weeks ago
- ☆22Jan 26, 2024Updated 2 years ago
- Must-read papers on Reinforcement Learning (RL)☆54Nov 9, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.☆19Apr 27, 2025Updated 11 months ago
- Must-read papers on Knowledge Graph Embedding☆29Oct 15, 2020Updated 5 years ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆29Sep 27, 2024Updated last year
- Papers of Implicit Reasoning in LLMs.☆24Mar 13, 2025Updated last year
- Python 高级编程☆15Dec 18, 2019Updated 6 years ago
- Experiments with reasoning models, training techniques, papers☆28Updated this week
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- Bombing AI agents☆12Jun 21, 2018Updated 7 years ago
- ☆10Jul 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆19Jun 18, 2025Updated 9 months ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…☆19Mar 10, 2026Updated last month
- A curated paper list for "Foundation Neural Operators: A Survey on Pretraining Methods, Data Ecosystems, and Efficient Adaptation".☆34Feb 14, 2026Updated last month
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆17Apr 15, 2025Updated 11 months ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated 11 months ago
- Must-read papers on Knowledge Graph Reasoning (KGR)☆21Mar 16, 2020Updated 6 years ago
- ☆35Mar 6, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 11 months ago
- ☆16May 22, 2025Updated 10 months ago
- Scaling Agentic Environments Automatically.☆57Mar 26, 2026Updated 2 weeks ago
- Synthetic data library used in operator learning for PDE problems that overcomes dependence on classical solvers such as finite differenc…☆17Aug 8, 2024Updated last year
- Code for "A Multi-Task BERT Model for Schema-Guided Dialogue State Tracking"☆14May 26, 2023Updated 2 years ago
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"☆13Jun 14, 2023Updated 2 years ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆25Sep 26, 2024Updated last year
- USTC computer practice code and interesting small projects☆12Apr 24, 2020Updated 5 years ago
- ☆14Jan 6, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆18Apr 24, 2024Updated last year
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- ☆17Aug 1, 2025Updated 8 months ago
- Adaptive Cut Selection in Mixed-Integer Linear Programming☆16Aug 2, 2023Updated 2 years ago
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆13Sep 21, 2022Updated 3 years ago
- ☆28Jul 16, 2024Updated last year
- Comprehensive Assessment of Trustworthiness in Multimodal Foundation Models☆28Mar 15, 2025Updated last year