πenhanced GRPO with more verifiable rewards and real-time evaluators
β37Jan 27, 2026Updated 2 months ago
Alternatives and similar repositories for R1
Users that are interested in R1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERTβ192Apr 17, 2023Updated 3 years ago
- [WMT 2022 champion system] Vega-MT model and inference scriptsβ41Feb 10, 2023Updated 3 years ago
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decodingβ16Nov 10, 2025Updated 5 months ago
- β12Jul 18, 2023Updated 2 years ago
- auto star for repo listsβ10Aug 26, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- β14Aug 18, 2022Updated 3 years ago
- The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMsβ27Jan 15, 2025Updated last year
- Tensorflow code for "Hierarchical Decompositional Mixtures of Variational Autoencoders" (ICML'19)β12Jun 7, 2020Updated 5 years ago
- BossNet: Disentangling Language and Knowledge in Task Oriented Dialogsβ17Dec 8, 2022Updated 3 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewritingβ17Nov 30, 2021Updated 4 years ago
- π[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translationβ73Mar 25, 2024Updated 2 years ago
- Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".β13Apr 18, 2022Updated 3 years ago
- Official PyTorch implementation of CD-MOEβ12Mar 18, 2026Updated 3 weeks ago
- Randomized algorithm class at CUβ15Jul 8, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Expertsβ35Jul 2, 2024Updated last year
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"β13Jun 7, 2023Updated 2 years ago
- A First Look at Conventional Commits Classificationβ13Nov 18, 2024Updated last year
- The open-source Mixture of Depths code and the official implementation of the paper "Router-Tuning: A Simple and Effective Approach for Eβ¦β31Mar 26, 2026Updated 3 weeks ago
- β38Jan 25, 2026Updated 2 months ago
- β20May 24, 2025Updated 10 months ago
- English and Chinese LaTeX template for reports/projects/proposal at Beijing Institute of Technologyβ10Nov 19, 2020Updated 5 years ago
- Spatial Aptitude Training for Multimodal Langauge Modelsβ27Feb 8, 2026Updated 2 months ago
- Towards Safe LLM with our simple-yet-highly-effective Intention Analysis Promptingβ21Mar 25, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [EMNLP22] Improving Sharpness-Aware Minimization with Fisher Mask for Better Generalization on Language Modelsβ22Mar 27, 2023Updated 3 years ago
- β12Jul 6, 2022Updated 3 years ago
- β14Feb 2, 2021Updated 5 years ago
- The official implement of DS2DP [TGRS 2022]β63Feb 15, 2025Updated last year
- Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"β22Feb 28, 2026Updated last month
- Official implementation of "HLRTF: Hierarchical Low-Rank Tensor Factorization for Inverse Problems in Multi-Dimensional Imaging," CVPR 20β¦β21Aug 6, 2022Updated 3 years ago
- KMean Coreset evaluation and computation.β12Jun 6, 2017Updated 8 years ago
- MFURLN relationship detection methodβ21May 17, 2020Updated 5 years ago
- Source code of COLING 2022 paper "A Contrastive Cross-channel Data Augmentation Framework for Aspect-based Sentiment Analysis"β22Feb 18, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.β18Apr 22, 2025Updated 11 months ago
- Math24o: ι«δΈε₯₯ζεΉε ζ°ε¦η«θ΅ζ΅θ―ι High School Olympiad Mathematics Chinese Benchmarkβ11Mar 27, 2025Updated last year
- MatClaw: an open materials-science agent that turns natural-language tasks into reproducible simulation workflows.β37Apr 8, 2026Updated last week
- Official implementations for Discourse-Aware Graph Networks for Textual Logical Reasoning (TPAMI) and DAGN: Discourse-Aware Graph Networkβ¦β28Feb 19, 2026Updated last month
- The implementation of the paper "Harvesting and Refining Question-Answer Pairs for Unsupervised QA"β33Nov 25, 2020Updated 5 years ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"β27Mar 13, 2026Updated last month
- Filipino multi-modal NLP dataset. Consists of 350k+ Filipino news articles and associated imagesβ14Mar 11, 2025Updated last year