sii-research / OpenMOSSLinks
OpenMOSS presents a collection of our research on LLMs, supported by SII, Fudan and Mosi.
☆25Updated 3 months ago
Alternatives and similar repositories for OpenMOSS
Users that are interested in OpenMOSS are comparing it to the libraries listed below
Sorting:
- llm & rl☆240Updated 2 weeks ago
- ☆205Updated last week
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆60Updated 7 months ago
- In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning. Specifically, we construct a…☆61Updated 7 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆207Updated 6 months ago
- A Comprehensive Survey on Long Context Language Modeling☆199Updated 4 months ago
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆69Updated last year
- Extrapolating RLVR to General Domains without Verifiers☆177Updated 2 months ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆69Updated 7 months ago
- [Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.☆490Updated 3 weeks ago
- Latest Advances on Long Chain-of-Thought Reasoning☆542Updated 3 months ago
- Awesome LLM pre-training resources, including data, frameworks, and methods.☆276Updated 6 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆384Updated this week
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆188Updated last week
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆312Updated 3 weeks ago
- [ACL 2025] "World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning." https://arxiv.org/abs/2503.1…☆13Updated 3 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆360Updated last month
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆85Updated 5 months ago
- The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.☆364Updated last month
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆264Updated last week
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆139Updated 2 weeks ago
- Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning☆103Updated 3 weeks ago
- Reinforcement Learning in LLM and NLP.☆61Updated last month
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…☆57Updated 5 months ago
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆220Updated 3 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆254Updated 2 months ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆186Updated 4 months ago
- A repository sharing the literatures about large language models☆103Updated 4 months ago
- ☆172Updated 6 months ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆138Updated 4 months ago