xinsheng-reborn / Open-Heart-VoiceLinks
☆26Updated 3 months ago
Alternatives and similar repositories for Open-Heart-Voice
Users that are interested in Open-Heart-Voice are comparing it to the libraries listed below
Sorting:
- ☆39Updated 4 months ago
- ☆367Updated 2 weeks ago
- ☆278Updated 3 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆73Updated this week
- Official Repository of "Learning what reinforcement learning can't"☆68Updated last month
- A comprehensive collection of process reward models.☆115Updated 3 weeks ago
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆147Updated 3 weeks ago
- 讨贼王云鹤檄文☆1,092Updated 3 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆848Updated 3 months ago
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆26Updated 8 months ago
- collecting publicly available distillation datasets based on DepSeek-R1☆23Updated 7 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆84Updated last month
- ☆18Updated 5 months ago
- A Flexible Framework for Comprehensive Multimodal Model Evaluation☆90Updated last week
- [NeurIPS 2025]⭐️ Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning.☆229Updated 3 weeks ago
- ☆414Updated 3 weeks ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆179Updated 2 years ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆195Updated 5 months ago
- Official implementation for "Android in the Zoo: Chain-of-Action-Thought for GUI Agents" (Findings of EMNLP 2024)☆94Updated last year
- Interleaving Reasoning: Next-Generation Reasoning Systems for AGI☆187Updated 2 weeks ago
- ☆303Updated 5 months ago
- Code for the paper "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"☆133Updated 2 months ago
- A Telegram bot to recommend arXiv papers☆286Updated 6 months ago
- Survey on Data-centric Large Language Models☆87Updated last year
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆360Updated 3 months ago
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆826Updated 5 months ago
- Reproducing R1 for Code with Reliable Rewards☆262Updated 5 months ago
- ☆909Updated last week
- Curation of resources for LLM research, screened by @tongyx361 to ensure high quality and accompanied with elaborately-written concise de…☆61Updated last year
- Code for paper: Reinforced Vision Perception with Tools☆57Updated 3 weeks ago