Official code repo for our work "Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models"
☆53Jun 17, 2025Updated 11 months ago
Alternatives and similar repositories for NativeRes-LLaVA
Users that are interested in NativeRes-LLaVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?☆143Jul 24, 2025Updated 10 months ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆22Feb 26, 2025Updated last year
- (ICLR 2026 🔥) Code for "The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs"☆79Feb 9, 2026Updated 4 months ago
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆16Jul 15, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆41Nov 11, 2025Updated 7 months ago
- A Framework for Collaboration of Experts from Benchmark☆13Apr 27, 2025Updated last year
- ☆177Apr 15, 2025Updated last year
- MLCD-Seg is a zero-shot segmentation model from DeepGlint.☆18Jul 4, 2025Updated 11 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- A minimal, educational HEVC (H.265) encoder written in Python.☆53Feb 23, 2026Updated 3 months ago
- Data Efficacy for Language Model Training