Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"
☆18Mar 21, 2023Updated 3 years ago
Alternatives and similar repositories for HighlightDetection-CLC
Users that are interested in HighlightDetection-CLC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆31Sep 20, 2021Updated 4 years ago
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Oct 27, 2024Updated last year
- ☆38Oct 11, 2022Updated 3 years ago
- ☆14Oct 30, 2023Updated 2 years ago
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or …☆237Apr 15, 2024Updated last year
- Python3 Implementation for 'Visual Rhythm and Beat' SIGGRAPH 2018☆20May 31, 2022Updated 3 years ago
- A dataset for Audio-Visual Sound Event Detection in Movies☆26Jan 23, 2023Updated 3 years ago
- Allows AI Agents to sleep for a specified amount of milliseconds, like when they should wait for an API to complete☆19Feb 28, 2025Updated last year
- [CVPR 2022] Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging☆50Sep 30, 2023Updated 2 years ago
- Paper "SeqRank: Sequential Ranking of Salient Objects" is accepted in AAAI-24.☆11Jun 12, 2024Updated last year
- Codes for the AAAI 2023 paper (Oral) "Efficient Mirror Detection via Multi-level Heterogeneous Learning" https://arxiv.org/pdf/2211.1564…☆13Jan 18, 2023Updated 3 years ago
- Paper "Learning-Semantic-Associations-for-Mirror-Detection" is accepted in CVPR 2022☆13Feb 21, 2024Updated 2 years ago
- ☆37Apr 7, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆25Aug 8, 2025Updated 7 months ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- [ECCV2024]The official implementation of the DiffPNG paper in PyTorch.☆17Oct 17, 2024Updated last year
- 📦 A lightweight machine learning toolkit for researchers, providing common model design & learning functionalities.☆28Jul 2, 2025Updated 8 months ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆106Feb 14, 2023Updated 3 years ago
- This is an official pytorch implementation of 'Group-wise Inhibition based Feature Regularization for Robust Classification' (ICCV 2021 a…☆10Dec 10, 2022Updated 3 years ago
- Combating Mode Collapse via Manifold Entropy Estimation☆11Apr 21, 2023Updated 2 years ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆20Mar 8, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pytorch implementation of paper "DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion"☆24Sep 1, 2023Updated 2 years ago
- [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want☆14Jan 5, 2025Updated last year
- Official code for "Audio-Guided Attention Network for Weakly Supervised Violence Detection" (ICCECE2022).☆13Mar 25, 2022Updated 4 years ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- Vision Transformers are Parameter-Efficient Audio-Visual Learners☆107Aug 11, 2023Updated 2 years ago
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- Stanford's Bunny written in OpenGL☆11Dec 25, 2018Updated 7 years ago
- A Novel Micro-Expression Recognition Approach Using Attention-Based Magnification-Adaptive Networks, in ICASSP, 2022☆14Nov 22, 2024Updated last year
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated 9 months ago
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆53Feb 1, 2026Updated last month
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- ☆13Jul 10, 2024Updated last year
- Research of DeepSeek Engram Architecture based on Qwen-3 and Stable Diffusion series.☆45Feb 14, 2026Updated last month
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 5 months ago
- [ICML2023] Long-Term Rhythmic Video Soundtracker☆62Jul 28, 2025Updated 7 months ago