Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Grounding"
☆51Dec 30, 2023Updated 2 years ago
Alternatives and similar repositories for ADPN-MM
Users that are interested in ADPN-MM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…☆40Jan 20, 2025Updated last year
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated 2 years ago
- [2021 MultiMedia] CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval☆43Sep 23, 2021Updated 4 years ago
- Source code of our MM'22 paper Partially Relevant Video Retrieval☆56Nov 4, 2024Updated last year
- ☆11Oct 29, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [Tensorflow] A Game Theoretic approach using GAN for Phishing URL synthesis and detection☆11Nov 14, 2022Updated 3 years ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆28Sep 25, 2024Updated last year
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆36Feb 26, 2025Updated last year
- Chinese CLIP models with SOTA performance.☆62Aug 28, 2023Updated 2 years ago
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videos☆20Mar 3, 2025Updated last year
- Official implement of MIA-DPO☆69Jan 23, 2025Updated last year
- ☆15Dec 11, 2024Updated last year
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Feb 9, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Trying to implement https://arxiv.org/abs/2305.08891☆34Jun 10, 2023Updated 3 years ago
- [EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models☆147Aug 21, 2025Updated 9 months ago
- [CVPR 2025, Highlight] The official implementation of the paper "Unleashing In-context Learning of Autoregressive Models for Few-shot Ima…☆28Jun 6, 2025Updated last year
- ☆12Aug 7, 2024Updated last year
- [AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding☆128Dec 10, 2024Updated last year
- Precision Search through Multi-Style Inputs☆73Jul 30, 2025Updated 10 months ago
- 阅读顺序、Layoutreader☆18May 8, 2025Updated last year
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆130Aug 23, 2024Updated last year
- ☆16Feb 26, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning☆12Dec 10, 2023Updated 2 years ago
- Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)☆19Mar 13, 2024Updated 2 years ago
- ☆10Jul 16, 2025Updated 11 months ago
- Phishing websites are fraudulent sites that impersonate a trusted party to gain access to sensitive information of an individual person o…☆15May 1, 2020Updated 6 years ago
- A Tiny Project For ASR model training and Deployment☆26Oct 14, 2022Updated 3 years ago
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion☆46Aug 1, 2024Updated last year
- 前沿论文持续更新--视频时刻定位 or 时域语言定位 or 视频片段检索。☆264Aug 26, 2023Updated 2 years ago
- Mental stress has become a standard part of day-to-day life. However, experiencing long-term and high-level stress affects the daily life…☆15Dec 8, 2022Updated 3 years ago
- Code for HAR-GCNN: Deep Graph CNNs for Human Activity Recognition From Highly Unlabeled Mobile Sensor Data, IEEE PerCom CoMoRea 2022☆13May 9, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆44Apr 13, 2024Updated 2 years ago
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)☆860Jul 29, 2024Updated last year
- [NeurIPS'24] I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing☆33Dec 9, 2025Updated 6 months ago
- ☆260Dec 10, 2022Updated 3 years ago
- Code and data of "Controllable Unsupervised Event-based Video Generation" (accepted as ICIP oral and invited by WACV workshop)☆19Nov 5, 2024Updated last year
- ☆17Apr 21, 2026Updated last month