[ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"
☆80Oct 25, 2025Updated 7 months ago
Alternatives and similar repositories for q-frame
Users that are interested in q-frame are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Boosting the Class-Incremental Learning in 3D Point Clouds via Zero-Collection-Cost Basic Shape Pre-Training☆13Nov 30, 2024Updated last year
- [SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆15Jul 1, 2023Updated 2 years ago
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆28Jun 4, 2026Updated 2 weeks ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- [CVPR 2026 Highlight] VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding☆121Apr 17, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection☆23Nov 14, 2024Updated last year
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆77Jun 26, 2025Updated 11 months ago
- ☆15Oct 10, 2024Updated last year
- ☆13Sep 28, 2024Updated last year
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google de…☆48Jul 8, 2025Updated 11 months ago
- ☆230May 26, 2026Updated 3 weeks ago
- ☆31Nov 1, 2023Updated 2 years ago
- ☆15Jul 9, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆25Aug 8, 2025Updated 10 months ago
- [AAAI'26] Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augm…☆11Dec 5, 2025Updated 6 months ago
- Technical Challenge Repository for Visual Anomaly Detection Workshop (VAND) at CVPR☆14Jul 21, 2025Updated 10 months ago
- Inception-I3D, Non Local finetune, hmdb51_flow☆15Oct 15, 2019Updated 6 years ago
- SurgLaVi: Official repository☆36Mar 4, 2026Updated 3 months ago
- ☆14Sep 28, 2023Updated 2 years ago
- ☆10Aug 1, 2021Updated 4 years ago
- Plato is a system for viewport adaptation based bitrate adaptive VR video streaming.☆15May 1, 2018Updated 8 years ago
- ☆31Feb 27, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2026] The official implementation of "Dichotomous Diffusion Policy Optimization"☆43May 2, 2026Updated last month
- An implementation of MSSRM method☆10Mar 23, 2023Updated 3 years ago
- ☆11Jan 18, 2024Updated 2 years ago
- ☆30Feb 18, 2022Updated 4 years ago
- Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection☆27May 13, 2026Updated last month
- [CVPR 2023] Better “CMOS” Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution☆10Mar 19, 2024Updated 2 years ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆18Oct 8, 2023Updated 2 years ago
- Real-Time and Accurate Object Detection in Compressed Video by Long Short-term Feature Aggregation☆20Apr 13, 2021Updated 5 years ago
- ☆41May 12, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [MICCAI 2022] Toward Clinically Assisted Colorectal Polyp Recognition via Structured Cross-modal Representation Consistency☆13Nov 8, 2024Updated last year
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Feb 15, 2026Updated 4 months ago
- Transferring Genshin PVs into a freehand style with Diffusion Model.☆10Jun 5, 2024Updated 2 years ago
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Sep 14, 2023Updated 2 years ago
- Reinforcing Action Policies by Prophesying☆41Nov 26, 2025Updated 6 months ago
- ☆28Aug 2, 2023Updated 2 years ago
- ☆19Apr 10, 2025Updated last year