🚀enhanced GRPO with more verifiable rewards and real-time evaluators
☆37Jan 27, 2026Updated last month
Alternatives and similar repositories for R1
Users that are interested in R1 are comparing it to the libraries listed below
Sorting:
- 🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT☆192Apr 17, 2023Updated 2 years ago
- Code for Retrieval-Augmented Perception (ICML 2025)☆68Aug 10, 2025Updated 6 months ago
- A First Look at Conventional Commits Classification☆12Nov 18, 2024Updated last year
- Filipino multi-modal NLP dataset. Consists of 350k+ Filipino news articles and associated images☆12Mar 11, 2025Updated 11 months ago
- Math24o: 高中奥林匹克数学竞赛 测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- Gender prediction of chinese name based on LSTM☆14Mar 16, 2023Updated 2 years ago
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated last year
- English and Chinese LaTeX template for reports/projects/proposal at Beijing Institute of Technology☆10Nov 19, 2020Updated 5 years ago
- Implementation of CVPR2017 paper "A Hierarchical Approach for Generating Descriptive Image Paragraphs" in Tensorflow (in progress...)☆13Jan 27, 2018Updated 8 years ago
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- The hyper-parameters tuning and black box optimization games☆13Apr 20, 2023Updated 2 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Jan 25, 2021Updated 5 years ago
- FR-TSVM☆12Nov 20, 2017Updated 8 years ago
- ☆12Jul 18, 2023Updated 2 years ago
- A peer-to-peer communication system. BIT 小学期软件开发实训。☆11Sep 7, 2018Updated 7 years ago
- Gaussian Process Classification and Regression on Apache Spark☆11Mar 29, 2021Updated 4 years ago
- ☆12Jul 6, 2022Updated 3 years ago
- Code for "Counterfactual Variable Control for Robust and Interpretable Question Answering"☆14Oct 13, 2020Updated 5 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Jul 31, 2023Updated 2 years ago
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- Based on the R1-Zero method, using rule-based rewards and GRPO on the Code Contests dataset.☆18Apr 22, 2025Updated 10 months ago
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 4 years ago
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 4 years ago
- Spatial Aptitude Training for Multimodal Langauge Models☆24Feb 8, 2026Updated 3 weeks ago
- Bayesian Black Box Hyper Parameter Optimizer☆12May 3, 2017Updated 8 years ago
- Code for paper: Variance Reduced Local SGD with Lower Communication Complexity☆12May 20, 2020Updated 5 years ago
- Feature resources of "Diagnosing the Environment Bias in Vision-and-Language Navigation"☆16May 6, 2020Updated 5 years ago
- ☆14May 4, 2024Updated last year
- Serializing molecule 3D structures☆14Nov 27, 2024Updated last year
- ☆18Mar 27, 2023Updated 2 years ago
- [ACL 2024] Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive Decoding☆15Nov 10, 2025Updated 3 months ago
- All-in-one benchmarking platform for evaluating LLM.☆15Nov 12, 2025Updated 3 months ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- Some notes and code test about Deep Learning☆15Jul 12, 2020Updated 5 years ago
- ☆20Apr 24, 2025Updated 10 months ago
- Code base for the EMNLP 2021 paper, "Multi-granularity Textual Adversarial Attack with Behavior Cloning".☆13Apr 18, 2022Updated 3 years ago
- Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis☆17Mar 27, 2023Updated 2 years ago
- 中文 NLP 语料库数据集☆20Dec 14, 2018Updated 7 years ago