[ICLR 2026] FOCUS: Efficient Keyframe Selection for Long Video Understanding
β70Apr 23, 2026Updated last month
Alternatives and similar repositories for FOCUS
Users that are interested in FOCUS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π A curated collection of papers and open-source code repositories dedicated to the application of Vision-Language Models (VLMs) for strβ¦β158May 12, 2026Updated last week
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)β23Aug 1, 2025Updated 9 months ago
- Visual Speech Recongnitionβ20Dec 24, 2024Updated last year
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large β¦β14Apr 1, 2025Updated last year
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referringβ25Aug 8, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Internal utility libraries for Pklβ16May 14, 2026Updated last week
- [EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questionsβ12Dec 18, 2023Updated 2 years ago
- Multilingual and Multiculture Benchmark and LLMβ36Updated this week
- β12Apr 19, 2024Updated 2 years ago
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videosβ36May 27, 2025Updated 11 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewardsβ36Oct 3, 2025Updated 7 months ago
- Data and models for Misinfo Reaction Frames paper.β14Jun 9, 2024Updated last year
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"β25Feb 2, 2025Updated last year
- Rethinking the Trust Region in LLM Reinforcement Learningβ54Mar 2, 2026Updated 2 months ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Reward Estimation for Variance Reduction in Deep Reinforcement Learningβ10May 8, 2018Updated 8 years ago
- β26Nov 20, 2025Updated 6 months ago
- β22Dec 3, 2025Updated 5 months ago
- Data and code for "Probing Spurious Correlations in Popular Event-Based Rumor Detection Benchmarks" (ECML-PKDD 2022)β11Jun 12, 2023Updated 2 years ago
- A self-adaptive and class-balanced approach to improve deep neural network performance in the presence of noisy labelsβ18Jul 2, 2024Updated last year
- An unofficial implementation using Pytorch for "Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly Types". Improve theβ¦β18Nov 17, 2023Updated 2 years ago
- The training codes of Jasper-Token-Compression-600Mβ19Nov 19, 2025Updated 6 months ago
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.β15May 26, 2025Updated 11 months ago
- β36Oct 23, 2025Updated 7 months ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.β25May 13, 2026Updated last week
- [NeurIPS 2025] Deep Memory Backtracking for Long Video Understandingβ67Feb 10, 2026Updated 3 months ago
- [IEEE TIP] Offical implementation for the work "BadCM: Invisible Backdoor Attack against Cross-Modal Learning".β14Aug 30, 2024Updated last year
- RΓΆttger et al. (2024): "IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance"β16Mar 6, 2026Updated 2 months ago
- ICME'19: Removing Rain in Videos: A Large-scale Database and A Two-stream ConvLSTM Approachβ12Jul 4, 2022Updated 3 years ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictorβ35Oct 13, 2025Updated 7 months ago
- β16Oct 21, 2025Updated 7 months ago
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MITβ32Feb 6, 2026Updated 3 months ago
- Code for the paper "IFFNeRF: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model"β12May 26, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The official repository of the first version of ACE-Brain foundation model.β76Mar 13, 2026Updated 2 months ago
- β23Jun 16, 2022Updated 3 years ago
- Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysisβ15May 16, 2024Updated 2 years ago
- A curated list of awesome resources for salient object ranking (SOR)β17Sep 28, 2025Updated 7 months ago
- β66May 7, 2026Updated 2 weeks ago
- The official repo of VideoAgentTrekβ50Oct 24, 2025Updated 6 months ago
- β18Aug 15, 2024Updated last year