multimodal-art-projection / COIG-P
☆36Updated last month
Alternatives and similar repositories for COIG-P
Users that are interested in COIG-P are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆36Updated 2 months ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆31Updated 4 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆72Updated 3 weeks ago
- The official repository of the Omni-MATH benchmark.☆82Updated 4 months ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆58Updated 5 months ago
- The code and data for the paper JiuZhang3.0☆44Updated 11 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆44Updated last week
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆62Updated 6 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆47Updated 4 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆48Updated 10 months ago
- Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆27Updated 9 months ago
- ☆63Updated last week
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆96Updated last month
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆48Updated this week
- ☆45Updated last month
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆95Updated last week
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆39Updated 9 months ago
- On Memorization of Large Language Models in Logical Reasoning☆64Updated last month
- An Open Math Pre-trainng Dataset with 370B Tokens.☆84Updated last month
- ☆22Updated 10 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆135Updated 3 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 5 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆43Updated 3 weeks ago
- The rule-based evaluation subset and code implementation of Omni-MATH☆21Updated 4 months ago
- This the implementation of LeCo☆31Updated 3 months ago
- ☆46Updated 11 months ago
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆46Updated 5 months ago
- A Comprehensive Survey on Long Context Language Modeling☆142Updated last month