[ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"
☆77Oct 25, 2025Updated 7 months ago
Alternatives and similar repositories for q-frame
Users that are interested in q-frame are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆15Jul 1, 2023Updated 2 years ago
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆27Aug 8, 2025Updated 9 months ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- [CVPR 2026 Highlight] VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding☆118Apr 17, 2026Updated last month
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆77Jun 26, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Oct 10, 2024Updated last year
- ☆12Apr 1, 2023Updated 3 years ago
- ☆23Jan 8, 2024Updated 2 years ago
- ☆31Nov 1, 2023Updated 2 years ago
- ☆15Jul 9, 2019Updated 6 years ago
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆25Aug 8, 2025Updated 9 months ago
- ☆14Sep 28, 2023Updated 2 years ago
- ☆10Aug 1, 2021Updated 4 years ago
- OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models☆123Apr 25, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆82Oct 15, 2023Updated 2 years ago
- Plato is a system for viewport adaptation based bitrate adaptive VR video streaming.☆15May 1, 2018Updated 8 years ago
- The Easiest Pytorch Implementation of Branching-DQN☆12Feb 10, 2021Updated 5 years ago
- An implementation of MSSRM method☆10Mar 23, 2023Updated 3 years ago
- ☆30Feb 18, 2022Updated 4 years ago
- Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection☆27May 13, 2026Updated 2 weeks ago
- [CVPR 2023] Better “CMOS” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution☆10Mar 19, 2024Updated 2 years ago
- We introduce CausalVQA, a benchmark dataset for video question answering (VQA) composed of question-answer pairs that probe models’ under…☆60Aug 18, 2025Updated 9 months ago
- ☆41May 12, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]☆187Mar 12, 2026Updated 2 months ago
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Feb 15, 2026Updated 3 months ago
- Multi-Object Tracker for the H.264 and MPEG-4 Compressed Domain.☆23Jul 6, 2023Updated 2 years ago
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Sep 14, 2023Updated 2 years ago
- Reinforcing Action Policies by Prophesying☆41Nov 26, 2025Updated 6 months ago
- The official repo of the paper titled DeH4R: A Decoupled and Hybrid Method for Road Network Graph Extraction.☆23Updated this week
- ☆28Aug 2, 2023Updated 2 years ago
- ☆18Apr 10, 2025Updated last year
- [ACL 2025] ⚖️ Temporally-aware MLLM for Biomedical Radiology Analysis and Report Generation. Flexible toolkit with MLLM backbone support,…☆29Mar 18, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [CVPR 2026 Highlight] A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens☆117May 8, 2026Updated 3 weeks ago
- Panoramic Out-of-Distribution Segmentation☆15Dec 21, 2025Updated 5 months ago
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆101Jul 15, 2024Updated last year
- ☆18Oct 22, 2024Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- ☆42Jan 17, 2026Updated 4 months ago