[ICLR 2026] FOCUS: Efficient Keyframe Selection for Long Video Understanding
☆72Apr 23, 2026Updated 2 months ago
Alternatives and similar repositories for FOCUS
Users that are interested in FOCUS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026 Highlight] VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding☆122Apr 17, 2026Updated 2 months ago
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics☆37Sep 10, 2025Updated 9 months ago
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆50Oct 9, 2025Updated 8 months ago
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆24Aug 8, 2025Updated 10 months ago
- Internal utility libraries for Pkl☆17Jun 25, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆27Apr 4, 2026Updated 2 months ago
- Baidu Qianfan Deep Research☆35Jun 8, 2026Updated 3 weeks ago
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆16Apr 1, 2025Updated last year
- Multilingual and Multiculture Benchmark and LLM☆42May 18, 2026Updated last month
- 国科大雁栖湖校区2024~2025年课程资料,包括强化学习、智能计算系统、模式识别、矩阵分析与应用、人工智能原理与算法、自然语言处理☆50Sep 22, 2025Updated 9 months ago
- ☆12Apr 19, 2024Updated 2 years ago
- [CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding☆49Feb 28, 2026Updated 4 months ago
- [CVPR2022] Official Implementation of the paper 'Learning Where to Learn in Cross-View Self-Supervised Learning'☆29Oct 12, 2022Updated 3 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆26Feb 2, 2025Updated last year
- Data and code for "Probing Spurious Correlations in Popular Event-Based Rumor Detection Benchmarks" (ECML-PKDD 2022)☆11Jun 12, 2023Updated 3 years ago
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆38May 27, 2025Updated last year
- An unofficial implementation using Pytorch for "Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly Types". Improve the…☆18Nov 17, 2023Updated 2 years ago
- [CVPR 2026] Accelerating Streaming Video Large Language Models via Hierarchical Token Compression☆68Jun 8, 2026Updated 3 weeks ago
- The training codes of Jasper-Token-Compression-600M☆20Nov 19, 2025Updated 7 months ago
- ☆45Apr 28, 2026Updated 2 months ago
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆15May 26, 2025Updated last year
- ☆35Oct 23, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 4 months ago
- 已完成上理的学业,现停止更新,可移步https://github.com/SLctfTeam/USST-Lecture-Table-Calendar 。基于next.js的上海理工大学课程表日历订阅服务,支持将课表导入日历APP,并保持更新。☆14Jun 23, 2025Updated last year
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆12Sep 28, 2023Updated 2 years ago
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆26May 13, 2026Updated last month
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆14Mar 6, 2025Updated last year
- [NeurIPS 2025] Deep Memory Backtracking for Long Video Understanding☆68Feb 10, 2026Updated 4 months ago
- Self Evolving Large Multimodal Models with Continuous Rewards☆24Jun 9, 2026Updated 3 weeks ago
- ☆12Dec 29, 2021Updated 4 years ago
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"☆12May 26, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Overlooked Video Classification in Video Anomaly Detection☆19Oct 17, 2022Updated 3 years ago
- Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysis☆15May 16, 2024Updated 2 years ago
- JIT-compiled GPU kernels for quantum chemistry☆35Jan 30, 2026Updated 5 months ago
- ☆18Aug 15, 2024Updated last year
- Official implementation of "Long-Short Temporal Co-Teaching for Weakly Supervised Video Anomaly Detection"☆25Nov 6, 2024Updated last year
- Paper "SeqRank: Sequential Ranking of Salient Objects" is accepted in AAAI-24.☆11Jun 12, 2024Updated 2 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆23May 8, 2025Updated last year