Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"
☆18Mar 21, 2023Updated 3 years ago
Alternatives and similar repositories for HighlightDetection-CLC
Users that are interested in HighlightDetection-CLC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆31Sep 20, 2021Updated 4 years ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆16Oct 27, 2024Updated last year
- ☆38Oct 11, 2022Updated 3 years ago
- ☆14Oct 30, 2023Updated 2 years ago
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python3 Implementation for 'Visual Rhythm and Beat' SIGGRAPH 2018☆20May 31, 2022Updated 4 years ago
- A dataset for Audio-Visual Sound Event Detection in Movies☆26Jan 23, 2023Updated 3 years ago
- Paper "SeqRank: Sequential Ranking of Salient Objects" is accepted in AAAI-24.☆11Jun 12, 2024Updated 2 years ago
- Codes for the AAAI 2023 paper (Oral) "Efficient Mirror Detection via Multi-level Heterogeneous Learning" https://arxiv.org/pdf/2211.1564…☆15Jan 18, 2023Updated 3 years ago
- Paper "Learning-Semantic-Associations-for-Mirror-Detection" is accepted in CVPR 2022☆14Feb 21, 2024Updated 2 years ago
- [Pattern Recognition 2025 🌟]Unbiased Multiscale Modal Fusion Model for Multimodal Semantic Segmentation☆10Jun 12, 2024Updated 2 years ago
- [ACMMM 2024] An Inverse Partial Optimal Transport Framework for Music-guided Movie Trailer Generation☆16Mar 15, 2025Updated last year
- ☆37Apr 7, 2022Updated 4 years ago
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆28Jun 4, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆17Oct 17, 2024Updated last year
- Jin, Xiao, et al. "FCMNet: Frequency-aware cross-modality attention networks for RGB-D salient object detection." Neurocomputing 491 (202…☆11Apr 11, 2024Updated 2 years ago
- 📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.☆29Jul 2, 2025Updated last year
- This is an official pytorch implementation of 'Group-wise Inhibition based Feature Regularization for Robust Classification' (ICCV 2021 a…☆10Dec 10, 2022Updated 3 years ago
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- [2021 CVPR] Positive Sample Propagation along the Audio-Visual Event Line☆42Jul 5, 2022Updated 3 years ago
- Combating Mode Collapse via Manifold Entropy Estimation☆11Apr 21, 2023Updated 3 years ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆32May 3, 2024Updated 2 years ago
- Pytorch implementation of paper "DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion"☆25Sep 1, 2023Updated 2 years ago
- [2023 ACL] CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding☆31Aug 5, 2023Updated 2 years ago
- Official Implement of ECCV 2024 paper "Multi-modal Crowd Counting via a Broker Modality"☆18Mar 19, 2026Updated 3 months ago
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year
- Official code for "Audio-Guided Attention Network for Weakly Supervised Violence Detection" (ICCECE2022).☆13Mar 25, 2022Updated 4 years ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- ☆11Oct 4, 2022Updated 3 years ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆106Aug 11, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆26Jul 16, 2025Updated 11 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆25Mar 8, 2026Updated 3 months ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated last year
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆58Feb 1, 2026Updated 5 months ago
- ☆13Jul 10, 2024Updated last year
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 3 years ago