Multimodal_AI_Video_Dialogue
☆16Dec 3, 2024Updated last year
Alternatives and similar repositories for Multimodal_AI_Video_Dialogue
Users that are interested in Multimodal_AI_Video_Dialogue are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 비디오 기반 인공지능 대화시스템☆11Aug 16, 2023Updated 2 years ago
- Text-based Video Retrieval☆15Dec 4, 2024Updated last year
- Retrieval_OOD_for_Multimodal_AI☆11Dec 4, 2024Updated last year
- Dual-scale Doppler Attention for Human Identification☆47Aug 13, 2025Updated 10 months ago
- SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval (ICCV'2023), [STARLAB] This repositery is a system to…☆57Apr 14, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- FRAG: Frequency Adaptive Group for Diffusion Video Editing (ICML 2024)☆70Aug 23, 2025Updated 9 months ago
- Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval☆65Dec 13, 2021Updated 4 years ago
- Test-time Procrustes Calibration for Diffusion-based Human Image Animation, NeurIPS 2024☆52Aug 23, 2025Updated 9 months ago
- HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue, EMNLP 2023 (long, findings) [STARLAB] Audio Enhancement for video-dial…☆57Dec 23, 2023Updated 2 years ago
- ICML 2024, Official Implementation of "Cross-view Masked Diffusion Transformers for Person Image Synthesis."☆53Nov 5, 2024Updated last year
- ☆39Dec 21, 2024Updated last year
- Official code for "SimPSI: A Simple Strategy to Preserve Spectral Information in Time Series Data Augmentation", AAAI 2024.☆36Jan 22, 2025Updated last year
- [ICCV'25] TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis☆35Sep 22, 2025Updated 8 months ago
- [ECCV'24] Official code for "BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation"☆43Nov 19, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ECCV 2024] FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing☆75Aug 13, 2025Updated 10 months ago
- ☆18Nov 19, 2024Updated last year
- CCFDM reinforcement learning☆40Dec 28, 2021Updated 4 years ago
- [IEEE Access 2022] AI for detecting BPPV disorders specified by beatings, torsional movements of the eyes☆37Nov 25, 2022Updated 3 years ago
- DimCL: Dimensional Contrastive Learning☆30Dec 9, 2025Updated 6 months ago
- [CVPR 2025] ITA-MDT official implementation☆67Dec 21, 2025Updated 5 months ago
- [ICML'25 Spotlight] FlowDrag: 3D-aware Drag-based Image Editing with Mesh-guided Deformation Vector Flow Fields☆45Dec 28, 2025Updated 5 months ago
- ☆33Nov 26, 2024Updated last year
- ☆39Dec 14, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICLR'25] MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation☆39Dec 25, 2025Updated 5 months ago
- ☆28Mar 13, 2025Updated last year
- 비디오 기반 인공지능 대화시스템☆14Dec 23, 2023Updated 2 years ago
- DNI: Dilutional Noise Initialization for Diffusion Video Editing (ECCV 2024)☆46Jul 17, 2024Updated last year
- PyTorch implementation of **Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses**☆31Dec 9, 2025Updated 6 months ago
- Winning SubNetwork (WSN)☆59Jan 17, 2024Updated 2 years ago
- Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (…☆49Nov 19, 2024Updated last year
- [ICLR'25] Official code for "Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models"☆35Dec 26, 2025Updated 5 months ago
- MSIT AI Fair(MAF)☆39May 12, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆38Jan 8, 2026Updated 5 months ago
- This is official repository for Dual Temperature Helps Contrastive Learning without Many Negative Samples (CVPR2022)☆27Dec 1, 2022Updated 3 years ago
- AI Development in Evolving Policy [AI DEP]☆46Jul 7, 2025Updated 11 months ago
- Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization (IROS 2024)☆44Jun 19, 2025Updated 11 months ago
- ☆28Mar 13, 2025Updated last year
- Predictive Coding for Decision Transformer (IROS 2024)☆41Jun 19, 2025Updated 11 months ago
- Fast and Efficient MMD-based Fair PCA via Optimization over Stiefel Manifold (AAAI 2022)☆11Sep 27, 2022Updated 3 years ago