🚀enhanced GRPO with more verifiable rewards and real-time evaluators
☆37Jan 27, 2026Updated 4 months ago
Alternatives and similar repositories for R1
Users that are interested in R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 7 months ago
- A multi-lingual benchmark for evaluating industrial domain knowledge of LLMs.☆153Updated this week
- The code for the paper "Dual Mutual Information Constraints for Discriminative Clustering"☆23Aug 22, 2024Updated last year
- Code for Retrieval-Augmented Perception (ICML 2025)☆71Apr 22, 2026Updated last month
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding☆17Nov 10, 2025Updated 7 months ago
- ☆12Jul 18, 2023Updated 2 years ago
- Tensorflow code for "Hierarchical Decompositional Mixtures of Variational Autoencoders" (ICML'19)☆12Jun 7, 2020Updated 6 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Multi-Level Memory for Task Oriented Dialogs☆15Jul 19, 2019Updated 6 years ago
- 🎁[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translation☆73Mar 25, 2024Updated 2 years ago
- Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".☆14Feb 28, 2026Updated 3 months ago
- awesome video representation learning☆15Mar 22, 2021Updated 5 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- TeD-Q (Tensor-network enhanced Distributed Quantum) is a tensor network enhanced distributed hybrid quantum machine learning framework.☆96Mar 10, 2023Updated 3 years ago
- Randomized algorithm class at CU☆17Jul 8, 2025Updated 11 months ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆36Jul 2, 2024Updated last year
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- A First Look at Conventional Commits Classification☆15Nov 18, 2024Updated last year
- The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for E…☆31May 12, 2026Updated last month
- ☆20May 24, 2025Updated last year
- ☆77Apr 17, 2026Updated last month
- English and Chinese LaTeX template for reports/projects/proposal at Beijing Institute of Technology☆10Nov 19, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Spatial Aptitude Training for Multimodal Langauge Models☆33Feb 8, 2026Updated 4 months ago
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Prompting☆21Mar 25, 2024Updated 2 years ago
- ☆46May 3, 2026Updated last month
- ☆14Feb 2, 2021Updated 5 years ago
- ☆12Jul 6, 2022Updated 3 years ago
- The official implement of DS2DP [TGRS 2022]☆63Feb 15, 2025Updated last year
- Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"☆23Feb 28, 2026Updated 3 months ago
- ☆12Mar 15, 2024Updated 2 years ago
- Official implementation of "HLRTF: Hierarchical Low-Rank Tensor Factorization for Inverse Problems in Multi-Dimensional Imaging," CVPR 20…☆21Aug 6, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- KMean Coreset evaluation and computation.☆12Jun 6, 2017Updated 9 years ago
- MFURLN relationship detection method☆21May 17, 2020Updated 6 years ago
- Source code of COLING 2022 paper "A Contrastive Cross-channel Data Augmentation Framework for Aspect-based Sentiment Analysis"☆22Feb 18, 2023Updated 3 years ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated last year
- MatClaw: an open materials-science agent that turns natural-language tasks into reproducible simulation workflows.☆48Apr 8, 2026Updated 2 months ago
- An "end-to-end trainable task-oriented dialogue model" implementation.☆37Dec 8, 2022Updated 3 years ago
- A peer-to-peer communication system. BIT 小学期软件开发实训。☆11Sep 7, 2018Updated 7 years ago