☆25Apr 9, 2025Updated 11 months ago
Alternatives and similar repositories for Ocean-R1
Users that are interested in Ocean-R1 are comparing it to the libraries listed below
Sorting:
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Aug 28, 2024Updated last year
- ☆28Feb 18, 2025Updated last year
- ☆47Apr 9, 2025Updated 11 months ago
- ☆14Jul 15, 2025Updated 8 months ago
- Author implementation of "Contextualized Word Representations for Reading Comprehension" (Salant et al. 2017)☆11Jun 14, 2018Updated 7 years ago
- ☆218Feb 20, 2025Updated last year
- Code for paper "Out-of-domain detection for natural language understanding in dialog systems"☆10May 27, 2022Updated 3 years ago
- Official code for the paper "Self-Distillation for Few-Shot Image Captioning"☆16Mar 15, 2021Updated 5 years ago
- [NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆274Oct 5, 2025Updated 5 months ago
- Implementation of paper: "A Neural Attention Model for Sentence Summarization" in Theano☆10Mar 30, 2019Updated 6 years ago
- ☆107Jun 10, 2025Updated 9 months ago
- ☆16Jul 29, 2025Updated 7 months ago
- kaggle:otto competition☆24Feb 13, 2023Updated 3 years ago
- SysBench: Can Large Language Models Follow System Messages?☆39Sep 4, 2024Updated last year
- Solution to kaggle competition OTTO – Multi-Objective Recommender System: https://www.kaggle.com/competitions/otto-recommender-system☆21Feb 2, 2023Updated 3 years ago
- CFBench: A Comprehensive Constraints-Following Benchmark for LLMs☆50Aug 26, 2024Updated last year
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆1,380Feb 26, 2026Updated 3 weeks ago
- Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"☆125Feb 4, 2026Updated last month
- This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.☆13Aug 10, 2023Updated 2 years ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆131Jul 24, 2025Updated 7 months ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆31Mar 5, 2026Updated 2 weeks ago
- ☆20Apr 16, 2025Updated 11 months ago
- Official resources of "The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reaso…☆20Jun 13, 2025Updated 9 months ago
- LEO: A powerful Hybrid Multimodal LLM☆20Jan 18, 2025Updated last year
- [ICLR2026] This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that…☆1,036Jan 26, 2026Updated last month
- [ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large …☆24May 29, 2024Updated last year
- Formal representation and solving for Euclidean plane geometry problems.☆32Dec 19, 2025Updated 3 months ago
- ☆24Jul 21, 2016Updated 9 years ago
- CVPR2025☆21Aug 16, 2025Updated 7 months ago
- ☆28Feb 10, 2025Updated last year
- kaggle 2024 Eedi 第10名 金牌方案☆44Dec 28, 2024Updated last year
- Official code repository of Shuffle-R1☆25Feb 23, 2026Updated 3 weeks ago
- Neural network sequence labeling model - some sloppy modifications to the original toolkit to enable punctuation restoration in unsegment…☆10Jan 8, 2017Updated 9 years ago
- ☆10Jul 21, 2023Updated 2 years ago
- ☆54Sep 11, 2024Updated last year
- handy tools for user study☆21May 21, 2024Updated last year
- Original PyTorch implementation for AAAI 2021 Paper "Meta-Transfer Learning for Low-Resrouce Abstractive Summarization."☆26Jan 11, 2023Updated 3 years ago
- The repository for "MedChain: Bridging the Gap Between LLM Agents and Real-World Clinical Decision Making"☆48Oct 10, 2025Updated 5 months ago
- Finding Camouflaged Needle in a Haystack? Pornographic Products Detection via Berrypicking Tree Model☆10Jul 29, 2019Updated 6 years ago