☆48Oct 20, 2025Updated 4 months ago
Alternatives and similar repositories for verl-internvl
Users that are interested in verl-internvl are comparing it to the libraries listed below
Sorting:
- 北航“冯如杯”论文模板 (2022年)☆13Apr 24, 2022Updated 3 years ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆55Mar 9, 2025Updated last year
- Multi-gpu/distributed training script in Tensorflow 1.x.☆17Nov 6, 2019Updated 6 years ago
- 【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending☆14Jun 16, 2025Updated 9 months ago
- ☆14May 26, 2025Updated 9 months ago
- 🔥🔥[NeurIPS2025]Exploring and mitigating semantic hallucinations in scene text perception and reasoning☆27Dec 11, 2025Updated 3 months ago
- ☆14Jun 6, 2023Updated 2 years ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- Python package to accelerate research on generalized out-of-distribution (OOD) detection.☆15Jun 19, 2024Updated last year
- Code for the paper: "TB-Net: A Three-Stream Boundary-Aware Network for Fine-Grained Pavement Disease Segmentation"☆11Nov 10, 2020Updated 5 years ago
- [CVPR 2025 Highlight] Official repository of MapDR dataset proposed in paper "Driving by the Rules: A Benchmark for Integrating Traffic S…☆34May 28, 2025Updated 9 months ago
- ☆38Feb 3, 2026Updated last month
- AdaIFL: Adaptive Image Forgery Localization via a Dynamic and Importance-aware Transformer Network☆16Feb 11, 2025Updated last year
- Codes of Paper "Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding"☆20Aug 30, 2024Updated last year
- [ICLR 2024] Towards Elminating Hard Label Constraints in Gradient Inverision Attacks☆14Feb 6, 2024Updated 2 years ago
- BusterX and BusterX++☆37Mar 9, 2026Updated last week
- Human-centric environment representations from egocentric video☆14Feb 5, 2026Updated last month
- Repository for the Paper: Refusing Safe Prompts for Multi-modal Large Language Models☆18Oct 16, 2024Updated last year
- Implementation of GALS (GNSS-Augmented LiDAR SLAM)☆14Jul 5, 2022Updated 3 years ago
- [AAAI 2022] MadisNet: Inharmonious Region Localization by Magnifying Domain Discrepancy☆17Feb 24, 2026Updated 3 weeks ago
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆21Dec 3, 2023Updated 2 years ago
- Official code for ICML 2024 paper, "Connecting the Dots: Collaborative Fine-tuning for Black-Box Vision-Language Models"☆19Jun 12, 2024Updated last year
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- CVPR 2025 - R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning☆22Aug 28, 2025Updated 6 months ago
- The offical repository of "So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection"☆29Oct 29, 2025Updated 4 months ago
- ☆25Mar 13, 2021Updated 5 years ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- [NeurIPS 2025 Datasets & Benchmarks Track] The Illusion of Progress? A Critical Look at Test-Time Adaptation for Vision-Language Models☆34Oct 26, 2025Updated 4 months ago
- Docs of NLP/deep Learning/machine learning, etc. https://siat-nlp.github.io/docs☆11Jul 17, 2019Updated 6 years ago
- official repo for `thinking with images through-self-calling`☆25Dec 28, 2025Updated 2 months ago
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆23Dec 10, 2025Updated 3 months ago
- Comprehensive benchmark for video text understanding☆28Jun 4, 2025Updated 9 months ago
- ☆20Dec 14, 2024Updated last year
- ☆12Feb 2, 2024Updated 2 years ago
- This is the official PyTorch Implementation of "SoTTA: Robust Test-Time Adaptation on Noisy Data Streams (NeurIPS '23)" by Taesik Gong*, …☆22Mar 22, 2024Updated last year
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆15Mar 22, 2023Updated 2 years ago
- [EMNLP-2025] R1-Zero on ANY TASK☆30Nov 9, 2025Updated 4 months ago
- ☆18Apr 10, 2025Updated 11 months ago