REverse-Engineered Reasoning for Open-Ended Generation
☆93Sep 10, 2025Updated 5 months ago
Alternatives and similar repositories for REER_DeepWriter
Users that are interested in REER_DeepWriter are comparing it to the libraries listed below
Sorting:
- ☆31Sep 12, 2025Updated 5 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated 2 years ago
- a simple seqseq-autoencoder example of tensorflow☆13Sep 7, 2016Updated 9 years ago
- Reproducible and flexible LLM evaluations for scientific reasoning.☆26Jul 23, 2025Updated 7 months ago
- A simple implementation of ReasonGenRM.☆19Apr 21, 2025Updated 10 months ago
- [Blog 1] Recording a bug of grpo_trainer in some R1 projects☆22Feb 23, 2025Updated last year
- a survey on deep research☆47Sep 9, 2025Updated 5 months ago
- ☆25Dec 12, 2025Updated 2 months ago
- BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent☆190Dec 11, 2025Updated 2 months ago
- ☆23Dec 17, 2024Updated last year
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆102Feb 20, 2025Updated last year
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆48Oct 16, 2025Updated 4 months ago
- NExT-GPT: Any-to-Any Multimodal Large Language Model☆20Nov 3, 2024Updated last year
- ☆29Feb 24, 2025Updated last year
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆223Nov 27, 2025Updated 3 months ago
- SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward☆92Aug 8, 2025Updated 6 months ago
- RewardAnything: Generalizable Principle-Following Reward Models☆45Jun 11, 2025Updated 8 months ago
- PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".☆24Sep 19, 2021Updated 4 years ago
- ☆70Jun 18, 2025Updated 8 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated 10 months ago
- Supporting code for ReCEval paper☆31Sep 14, 2024Updated last year
- [NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning☆52Oct 23, 2025Updated 4 months ago
- The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"☆258Feb 4, 2026Updated last month
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆159Jun 26, 2025Updated 8 months ago
- instruction-following benchmark for large reasoning models☆44Aug 9, 2025Updated 6 months ago
- ☆33Oct 31, 2024Updated last year
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆62Oct 24, 2025Updated 4 months ago
- hwpxlib 패키지 python에서 쉽게 사용 할수 있게 만든 github repo 입니다.☆36Mar 29, 2025Updated 11 months ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆183Jun 5, 2025Updated 9 months ago
- ☆15Jan 1, 2025Updated last year
- CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆27Nov 1, 2025Updated 4 months ago
- ☆72Jun 10, 2025Updated 8 months ago
- Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)☆142Sep 21, 2024Updated last year
- ☆88Jun 7, 2024Updated last year
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Dec 25, 2025Updated 2 months ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- ☆22Dec 11, 2025Updated 2 months ago