msu-video-group / ECCVW24_Saliency_Prediction
ECCV-AIM 2024 Challenge on Video Saliency Prediction
β24Updated 4 months ago
Alternatives and similar repositories for ECCVW24_Saliency_Prediction:
Users that are interested in ECCVW24_Saliency_Prediction are comparing it to the libraries listed below
- This repository contains the code for the NeurIPS paper titled "RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentaβ¦β24Updated 2 months ago
- π See How Top MLLMs Understand Video Compositions.β18Updated 2 months ago
- Official Implementation for "Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing"β52Updated 5 months ago
- Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Predictionβ21Updated 8 months ago
- β14Updated 2 months ago
- π [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. π₯ Winner solution for Video Quality Assessment Challenge at the 1st AISβ¦β50Updated 7 months ago
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Videoβ19Updated 6 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"β18Updated 3 months ago
- Official released code for VQAΒ² series modelsβ30Updated 3 weeks ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"β45Updated 4 months ago
- This is the GitHub repository for Data Augmentation for Saliency Prediction via Latent Diffusion paper in ECCV 2024, Milano, Italyβ11Updated 3 months ago
- AAAI-2024β20Updated 10 months ago
- Benchmark for generative image modelsβ74Updated last year
- Code for the paper "IQA-Adapter: Exploring Knowledge Transfer from Image Quality Assessment to Diffusion-based Generative Models"β12Updated last month
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficientβ75Updated 3 weeks ago
- [Preprint] Number it: Temporal Grounding Videos like Flipping Mangaβ55Updated 2 months ago
- β£[ECCV 2024 Oral, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a benchβ¦β74Updated 4 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"β64Updated 4 months ago
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"β24Updated 4 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspectiveβ59Updated 3 months ago
- Official Implementation of "Magnet: We Never Know How Text-to-Image Diffusion Models Work, Until We Learn How Vision-Language Models Funcβ¦β19Updated 2 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignmentβ41Updated last month
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Followingβ28Updated 3 weeks ago
- [NeurIPS'2022] "Video compression dataset and benchmark of learning-based video-quality metrics", A. Antsiferova, S. Lavrushkin, M. Smirβ¦β22Updated 9 months ago
- Official code for WACV 2024 paper, "Annotation-free Audio-Visual Segmentation"β28Updated 4 months ago
- FQGAN: Factorized Visual Tokenization and Generationβ42Updated last month
- Official Implementation of VideoDPOβ49Updated last month
- The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentationβ17Updated 2 months ago
- [NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".β32Updated last month