Chain-of-Frames [CVPR 2026]
☆40Jul 2, 2025Updated 11 months ago
Alternatives and similar repositories for CoF
Users that are interested in CoF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A powerful white-box adversarial attack that exploits knowledge about the geometry of neural networks to find minimal adversarial perturb…☆12Aug 5, 2020Updated 5 years ago
- FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens☆17Sep 8, 2025Updated 9 months ago
- ☆28Aug 9, 2025Updated 10 months ago
- ☆13Jun 23, 2022Updated 3 years ago
- [ECCV 2024] Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models☆21Jul 17, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆33Jan 23, 2025Updated last year
- [ICML'20] Multi Steepest Descent (MSD) for robustness against the union of multiple perturbation models.☆25Jul 25, 2024Updated last year
- ☆16Apr 16, 2025Updated last year
- ☆10Oct 27, 2023Updated 2 years ago
- [TPAMI'2023]Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling☆11Jan 3, 2023Updated 3 years ago
- SpotEdit [NeurIPS 2025 W]☆17Sep 24, 2025Updated 8 months ago
- [CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant☆30Dec 2, 2025Updated 6 months ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30May 16, 2022Updated 4 years ago
- [NIPS2025] VideoChat-R1 & R1.5: Enhancing Spatio-Temporal Perception and Reasoning via Reinforcement Fine-Tuning☆266Oct 18, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆39Feb 13, 2025Updated last year
- [CVPR2026] Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"☆83May 12, 2026Updated last month
- EMMA [TMLR 2025]☆14Sep 25, 2025Updated 8 months ago
- Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)☆27Nov 27, 2024Updated last year
- OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents [NeurIPS 2025 Spotlight]☆67Sep 18, 2025Updated 8 months ago
- ☆39Nov 8, 2024Updated last year
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"☆18Aug 27, 2025Updated 9 months ago
- ☆46May 8, 2024Updated 2 years ago
- Python Package reimplementation of Holistically-Nested Edge Detection in PyTorch☆12Jan 5, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆13Aug 22, 2025Updated 9 months ago
- Website for MathVista☆21Jun 9, 2025Updated last year
- This repository aims to collect the articles and codes for the Visual Storytelling (VIST) task. VIST is a vision-and-language task. It ai…☆26Mar 3, 2021Updated 5 years ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 4 months ago
- Implementation of Confidence-Calibrated Adversarial Training (CCAT).☆44Aug 3, 2020Updated 5 years ago
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMs☆14Apr 23, 2026Updated last month
- CoS: Chain-of-Shot Prompting for Long Video Understanding☆53Feb 13, 2025Updated last year
- [ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models☆159Feb 19, 2026Updated 3 months ago
- Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization☆27Jun 14, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper "Finetuning CLIP to Reason about Pairwise Differences"☆21Oct 1, 2024Updated last year
- Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"☆26Apr 13, 2026Updated 2 months ago
- [ECCV 2024] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion☆21Jul 2, 2024Updated last year
- ☆67Feb 27, 2026Updated 3 months ago
- Provably Robust Boosted Decision Stumps and Trees against Adversarial Attacks [NeurIPS 2019]☆50Apr 25, 2020Updated 6 years ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- Artistic Vision-Language Understanding with Adapter-enhanced MiniGPT-4☆30May 31, 2023Updated 3 years ago