A curated collection of projects, benchmarks, and research papers focused on reproducing and advancing the DeepSeek R1 framework.
☆15Mar 19, 2025Updated last year
Alternatives and similar repositories for Awesome-DeepSeek-R1-Reproduction
Users that are interested in Awesome-DeepSeek-R1-Reproduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Datasets and Evaluation Scripts for CompHRDoc☆58Feb 25, 2025Updated last year
- ☆107Dec 5, 2025Updated 4 months ago
- Question-Directed Graph Attention Network for Numerical Reasoning over Text☆10Aug 14, 2020Updated 5 years ago
- A structured parsing technique for NER☆15May 26, 2023Updated 2 years ago
- Implementation example of Distributed Tensorflow☆10Jul 22, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Mar 22, 2024Updated 2 years ago
- 2016 Edition of the Free Encyclopedia of Mathematics (top-level repo)☆19Mar 8, 2016Updated 10 years ago
- ☆15Jan 23, 2025Updated last year
- ☆18Jul 25, 2025Updated 8 months ago
- ☆19Dec 6, 2024Updated last year
- ☆17Jul 4, 2025Updated 9 months ago
- [SemEval'19] Code for "HLT@SUDA at SemEval 2019 Task 1: UCCA Graph Parsing as Constituent Tree Parsing"☆18Oct 17, 2020Updated 5 years ago
- chinese wwm masking and ngram masking based on jieba☆11Jul 25, 2019Updated 6 years ago
- ☆17Oct 4, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆20Nov 16, 2022Updated 3 years ago
- 微信小程序支付、模板消息等实例☆16Sep 15, 2017Updated 8 years ago
- ☆16Oct 28, 2021Updated 4 years ago
- 🎯 企业级AI助手规则体系(中文版) - 专为中国开发者打造,支持Augment、Cursor、Claude Code、Trae AI等主流AI工具的一键安装和配置☆28Aug 1, 2025Updated 8 months ago
- Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking☆13Feb 5, 2023Updated 3 years ago
- ☆23Jul 2, 2025Updated 9 months ago
- ☆21Mar 19, 2024Updated 2 years ago
- ☆27Jul 13, 2023Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Jul 20, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Everything about AI4Gemetry (AI for geometry problem solving / theorem proving).☆18Dec 19, 2025Updated 4 months ago
- mask2former psg☆22Dec 12, 2022Updated 3 years ago
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆32Jan 22, 2024Updated 2 years ago
- This is an accurate implementation for IoU loss between two rotated polygons. This algorithm is accurate and differential, but there is n…☆18Mar 5, 2022Updated 4 years ago
- Data for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder"☆20Oct 26, 2023Updated 2 years ago
- ☆28Sep 9, 2025Updated 7 months ago
- Locally corrected Nyström (LCN), as proposed in "Scalable Optimal Transport in High Dimensions for Graph Distances, Embedding Alignment, …☆19Apr 26, 2023Updated 2 years ago
- We design models that generate conversational responses for factual questions using expert answer phrases from Question Answering systems…☆21Jul 2, 2020Updated 5 years ago
- Code for EMNLP22 SpaBERT: A Pretrained Language Model from Geographic Data for Geo-Entity Representation.☆23Jun 22, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- rtsp server, H264/H265/AAC/PCMA(G711A); TCP/UDP; support authentication☆75Oct 20, 2025Updated 5 months ago
- First steps with ORTC☆10Feb 3, 2019Updated 7 years ago
- Code for the EMNLP 2020 paper "Re-examining the Role of Schema Linking in Text-to-SQL".☆28Nov 23, 2020Updated 5 years ago
- ☆26Feb 3, 2023Updated 3 years ago
- [NAACL 2025 Findings] Code for "Perception Compressor: A Training-Free Prompt Compression Framework in Long Context Scenarios"☆25Mar 5, 2025Updated last year
- C++ client side library for building mediasoup based applications.☆64May 9, 2025Updated 11 months ago
- Artifact associated with the paper "Zero-Shot Transfer Learning with Synthesized Data for Multi-Domain Dialogue State Tracking"☆25May 4, 2020Updated 5 years ago