Reinforcement Learning of Vision Language Models with Self Visual Perception Reward
☆170Mar 14, 2026Updated last month
Alternatives and similar repositories for Vision-SR1
Users that are interested in Vision-SR1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluatio…☆61Jul 18, 2025Updated 9 months ago
- Synthetic Video hallucination and Mitigation☆22Sep 21, 2025Updated 7 months ago
- Self-evolving vision language models from zero data☆71Mar 14, 2026Updated last month
- VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆51Apr 17, 2026Updated 2 weeks ago
- ☆24Jun 18, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Jan 30, 2022Updated 4 years ago
- ☆12Apr 18, 2025Updated last year
- ☆20Jun 10, 2025Updated 10 months ago
- Computer-Use Agents as Judges for Generative UI☆45Nov 27, 2025Updated 5 months ago
- [AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal P…☆65Jan 27, 2026Updated 3 months ago
- ☆32Jul 29, 2024Updated last year
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆35Apr 13, 2026Updated 3 weeks ago
- ☆45Dec 16, 2025Updated 4 months ago
- Mixture of Lora Experts☆10Apr 7, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Implementation of ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO (AAAI'25)☆23Nov 25, 2025Updated 5 months ago
- [ICCV 2025] Boosting MLLM Reasoning with Text-Debiased Hint-GRPO☆47Jul 1, 2025Updated 10 months ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆20Jul 17, 2024Updated last year
- [NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos☆27Apr 8, 2025Updated last year
- ☆22Sep 16, 2025Updated 7 months ago
- JetMax robotic arm in Gazebo☆15Jul 29, 2021Updated 4 years ago
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning☆47Jul 17, 2025Updated 9 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆45Apr 22, 2026Updated last week
- [CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding☆206Dec 19, 2025Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆26Feb 2, 2023Updated 3 years ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆50Mar 31, 2026Updated last month
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 2 months ago
- Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…☆25Jan 26, 2025Updated last year
- [ACL Findings 2026] Official Implementation of "FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acc…☆31Apr 14, 2026Updated 3 weeks ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆31Jun 7, 2024Updated last year
- ☆150Apr 8, 2026Updated 3 weeks ago
- Heterogeneous Multi-agent Version of Highway-env☆18Jun 28, 2023Updated 2 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆34Sep 19, 2025Updated 7 months ago
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆64Feb 22, 2026Updated 2 months ago
- Multimodal Instruction Tuning with Conditional Mixture of LoRA (ACL 2024)☆32Aug 9, 2024Updated last year
- ☆107Jun 10, 2025Updated 10 months ago
- ☆16May 22, 2025Updated 11 months ago
- Real-time webcam demo with SmolVLM(mlx-community/SmolVLM-Instruct-4bit) and MLX-VLM☆26Jun 12, 2025Updated 10 months ago
- Codes for Merging Large Language Models☆36Aug 7, 2024Updated last year