sii-research / OpenMOSSLinks
OpenMOSS presents a collection of our research on LLMs, supported by SII, Fudan and Mosi.
☆26Updated last month
Alternatives and similar repositories for OpenMOSS
Users that are interested in OpenMOSS are comparing it to the libraries listed below
Sorting:
- ☆198Updated 4 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆154Updated this week
- llm & rl☆205Updated last week
- Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning☆72Updated last week
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆56Updated 5 months ago
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆21Updated last week
- ☆202Updated last week
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆204Updated 4 months ago
- Short RL☆13Updated 3 months ago
- A Comprehensive Survey on Long Context Language Modeling☆181Updated last month
- [Preprint] On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification.☆419Updated 2 weeks ago
- Extrapolating RLVR to General Domains without Verifiers☆151Updated 3 weeks ago
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆60Updated last year
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆289Updated 3 weeks ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆244Updated 3 weeks ago
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆65Updated last year
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆288Updated last month
- ☆58Updated 5 months ago
- The official repository of the Omni-MATH benchmark.☆87Updated 8 months ago
- Paper list for Efficient Reasoning.☆642Updated this week
- ☆21Updated last month
- A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.☆209Updated last month
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆81Updated 6 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆200Updated 3 weeks ago
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆134Updated last month
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!☆69Updated 5 months ago
- The trainer for HF to record losses of different tasks and objectives.☆44Updated 5 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆56Updated 9 months ago
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆315Updated last month
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆103Updated 3 weeks ago