A collection on the recent reproduction papers and projects on DeepSeek-R1
☆32Feb 27, 2025Updated last year
Alternatives and similar repositories for awesome-deepseek-r1
Users that are interested in awesome-deepseek-r1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A novel template-free retrosynthesizer that can generate diverse sets of reactants for a desired product via discrete conditional variati…☆15Aug 7, 2022Updated 3 years ago
- The code of paper Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic. Zhihai Wang, Jie Wang*, Qi Zhou, Bin…☆21May 26, 2022Updated 3 years ago
- The code of paper *Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization*.☆18Mar 26, 2022Updated 3 years ago
- This is the source code of our ICML25 paper, titled "Accelerating Large Language Model Reasoning via Speculative Search".☆23Jun 1, 2025Updated 9 months ago
- The code of paper "Learning Rule-Induced Subgraph Representations for Inductive Relation Prediction" in NeurIPS 2023.☆14Nov 25, 2023Updated 2 years ago
- Official Implement for Paper "Neural Krylov Iteration for Accelerating Linear System Solving"☆13May 6, 2025Updated 10 months ago
- Must-read papers on Reinforcement Learning (RL)☆54Nov 9, 2020Updated 5 years ago
- This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.☆19Apr 27, 2025Updated 10 months ago
- Testing Theory of Mind (ToM) in language models with epistemic logic☆22Dec 13, 2023Updated 2 years ago
- 针对最经典的表格型Q learning算法进行了复现,能够支持gym中大多数的离散动作和状态空间的环境,譬如CliffWalking-v0。☆10Jan 2, 2021Updated 5 years ago
- (NeurIPS 2024 Spotlight) TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment☆29Sep 27, 2024Updated last year
- Papers of Implicit Reasoning in LLMs.☆24Mar 13, 2025Updated last year
- Python 高级编程☆15Dec 18, 2019Updated 6 years ago
- ☆11Oct 14, 2019Updated 6 years ago
- Bombing AI agents☆12Jun 21, 2018Updated 7 years ago
- ☆19Dec 24, 2024Updated last year
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- ☆10Jul 13, 2024Updated last year
- ☆27Jan 15, 2026Updated 2 months ago
- ☆18Jun 18, 2025Updated 9 months ago
- [ICME 2019] Source code and datasets for "Semi-supervised Compatibility Learning Across Categories for Clothing Matching"☆10Apr 26, 2024Updated last year
- The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Ch…☆28Jul 15, 2025Updated 8 months ago
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- ☆18Nov 10, 2024Updated last year
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆18Apr 15, 2025Updated 11 months ago
- ☆15Jun 3, 2019Updated 6 years ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated 11 months ago
- Must-read papers on Knowledge Graph Reasoning (KGR)☆21Mar 16, 2020Updated 6 years ago
- ☆33Mar 6, 2026Updated 2 weeks ago
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated 10 months ago
- Scaling Agentic Environments Automatically.☆54Jan 22, 2026Updated 2 months ago
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"☆13Jun 14, 2023Updated 2 years ago
- ☆18Jul 25, 2025Updated 7 months ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆24Sep 26, 2024Updated last year
- USTC研究生学术报告选课脚本☆18Dec 6, 2022Updated 3 years ago
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆18Apr 24, 2024Updated last year
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆13Sep 21, 2022Updated 3 years ago
- ☆28Jul 16, 2024Updated last year