🚀enhanced GRPO with more verifiable rewards and real-time evaluators
☆37Jan 27, 2026Updated 3 months ago
Alternatives and similar repositories for R1
Users that are interested in R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 6 months ago
- 🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT☆191Apr 17, 2023Updated 3 years ago
- [WMT 2022 champion system] Vega-MT model and inference scripts☆41Feb 10, 2023Updated 3 years ago
- Code for paper: Variance Reduced Local SGD with Lower Communication Complexity☆12May 20, 2020Updated 5 years ago
- Code for Retrieval-Augmented Perception (ICML 2025)☆69Apr 22, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding☆17Nov 10, 2025Updated 5 months ago
- ☆14Aug 18, 2022Updated 3 years ago
- The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs☆27Jan 15, 2025Updated last year
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Multi-Level Memory for Task Oriented Dialogs☆15Jul 19, 2019Updated 6 years ago
- 🎁[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translation☆73Mar 25, 2024Updated 2 years ago
- Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".☆12Feb 28, 2026Updated 2 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- TeD-Q (Tensor-network enhanced Distributed Quantum) is a tensor network enhanced distributed hybrid quantum machine learning framework.☆96Mar 10, 2023Updated 3 years ago
- Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis☆17Mar 27, 2023Updated 3 years ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"☆13Jun 7, 2023Updated 2 years ago
- ☆11Jan 3, 2024Updated 2 years ago
- The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for E…☆31Apr 26, 2026Updated last week
- ☆20May 24, 2025Updated 11 months ago
- Spatial Aptitude Training for Multimodal Langauge Models☆31Feb 8, 2026Updated 2 months ago
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆21Mar 25, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆41Updated this week
- ☆12Jul 6, 2022Updated 3 years ago
- ☆14Feb 2, 2021Updated 5 years ago
- The official implement of DS2DP [TGRS 2022]☆63Feb 15, 2025Updated last year
- Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"☆22Feb 28, 2026Updated 2 months ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated last year
- A peer-to-peer communication system. BIT 小学期软件开发实训。☆11Sep 7, 2018Updated 7 years ago
- Gender prediction of chinese name based on LSTM☆14Mar 16, 2023Updated 3 years ago
- Pytorch code for "Attention Based Real Image Restoration", IEEE Transactions on Neural Networks and Learning Systems, 2021☆18Nov 23, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Apr 13, 2026Updated 3 weeks ago
- Sketching-based matrix computations for numpy arrays☆17Oct 29, 2019Updated 6 years ago
- Filipino multi-modal NLP dataset. Consists of 350k+ Filipino news articles and associated images☆14Mar 11, 2025Updated last year
- ☆30Mar 19, 2021Updated 5 years ago
- Code for WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge☆17Dec 31, 2024Updated last year
- some my implementation of content in PPA☆18Nov 3, 2020Updated 5 years ago