Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Grounding"
☆53Dec 30, 2023Updated 2 years ago
Alternatives and similar repositories for ADPN-MM
Users that are interested in ADPN-MM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of NeurIPS D&B Track 2024 paper "VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understan…☆40Jan 20, 2025Updated last year
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Feb 19, 2025Updated last year
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- UniMD: Towards Unifying Moment retrieval and temporal action Detection☆57Jul 5, 2024Updated last year
- Machine Learning Course From Scratch☆13Jul 24, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Unified Audio-Visual Perception for Multi-Task Video Localization☆31Apr 19, 2024Updated last year
- Source code of our MM'22 paper Partially Relevant Video Retrieval☆55Nov 4, 2024Updated last year
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Sep 25, 2024Updated last year
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆35Feb 26, 2025Updated last year
- Chinese CLIP models with SOTA performance.☆60Aug 28, 2023Updated 2 years ago
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videos☆19Mar 3, 2025Updated last year
- Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作☆30Mar 4, 2022Updated 4 years ago
- Official implement of MIA-DPO☆72Jan 23, 2025Updated last year
- DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis. [ACMMM 2024] Official PyTorch implementation☆39Sep 24, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Test-time Fourier Style Calibration for Domain Generalization - IJCAI 2022☆16Jul 21, 2022Updated 3 years ago
- ☆11Aug 7, 2024Updated last year
- 阅读顺序、Layoutreader☆19May 8, 2025Updated 10 months ago
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆130Aug 23, 2024Updated last year
- [AAAI 2024] UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning☆12Dec 10, 2023Updated 2 years ago
- EIVideo- 交互式智能视频标注工具,几次鼠标点击即可解放双手,让视频标注更加轻松☆32Jul 4, 2022Updated 3 years ago
- Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)☆20Mar 13, 2024Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- ☆10Jul 16, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of robust deep anomaly detection models using Partial Conditional Invariant Regularization (PCIR). Accepted to NeurIPS 202…☆13Oct 27, 2023Updated 2 years ago
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆12Jun 11, 2024Updated last year
- [CVPR 2025] Official implementation of paper "Multi-Granularity Class Prototype Topology Distillation for Class-Incremental Source-Free …☆17Aug 26, 2025Updated 7 months ago
- A Tiny Project For ASR model training and Deployment☆26Oct 14, 2022Updated 3 years ago
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion☆45Aug 1, 2024Updated last year
- The official repository for our ICLR2024 paper, DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Gene…☆59Dec 25, 2024Updated last year
- ☆12Jan 10, 2025Updated last year
- ☆17Dec 4, 2024Updated last year
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".☆14Feb 26, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆44Apr 13, 2024Updated last year
- About Codes for ACL 2023 paper: Exploiting! Multimodal Relation Extraction with Feature Denoising and Multimodal Topic Modeling.☆20Jun 25, 2024Updated last year
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- Code and data of "Controllable Unsupervised Event-based Video Generation" (accepted as ICIP oral and invited by WACV workshop)☆19Nov 5, 2024Updated last year
- Official Repository for "Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection" (AAAI …☆14Mar 1, 2025Updated last year
- 🔥🔥First-ever hour scale video understanding models☆618Jul 14, 2025Updated 8 months ago
- Space-Time Interaction Graph Parsing Networks for Human-Object Interaction Recognition,ACM MM'21☆14May 12, 2022Updated 3 years ago