dbstjswo505 / Retrieval_OOD_for_Multimodal_AILinks
Retrieval_OOD_for_Multimodal_AI
☆11Updated 9 months ago
Alternatives and similar repositories for Retrieval_OOD_for_Multimodal_AI
Users that are interested in Retrieval_OOD_for_Multimodal_AI are comparing it to the libraries listed below
Sorting:
- Text-based Video Retrieval☆14Updated 9 months ago
- Multimodal_AI_Video_Dialogue☆16Updated 9 months ago
- HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue, EMNLP 2023 (long, findings) [STARLAB] Audio Enhancement for video-dial…☆58Updated last year
- ☆33Updated 9 months ago
- ☆11Updated 2 years ago
- PyTorch implementation of **Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses**☆31Updated last month
- SCANet: Scene Complexity Aware Network for Weakly-Supervised Video Moment Retrieval (ICCV'2023), [STARLAB] This repositery is a system to…☆57Updated 4 months ago
- Test-time Procrustes Calibration for Diffusion-based Human Image Animation, NeurIPS 2024☆53Updated last week
- DNI: Dilutional Noise Initialization for Diffusion Video Editing (ECCV 2024)☆45Updated last year
- [ICCV'25] TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis☆31Updated 2 months ago
- [ICML'25 Spotlight] FlowDrag: 3D-aware Drag-based Image Editing with Mesh-guided Deformation Vector Flow Fields☆45Updated last month
- 비디오 기반 인공지능 대화시스템☆11Updated 2 years ago
- Dual-scale Doppler Attention for Human Identification☆48Updated 3 weeks ago
- Causal Localization Network for Radar Human Localization with micro-Doppler signature☆61Updated 11 months ago
- DCL: Dimensional Contrastive Learning☆30Updated last month
- (ICCV2025) Occlusion-robust Stylization for Drawing-based 3D Animation☆49Updated 2 weeks ago
- Implementation of Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning☆30Updated last month
- ☆39Updated 3 years ago
- ☆39Updated 8 months ago
- FRAG: Frequency Adaptive Group for Diffusion Video Editing (ICML 2024)☆70Updated last week
- [IEEE Access 2022] AI for detecting BPPV disorders specified by beatings, torsional movements of the eyes☆37Updated 2 years ago
- [ICML'25] Official code for "ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization"☆17Updated this week
- [ICLR'23] ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure☆41Updated last year
- Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval☆66Updated 3 years ago
- Winning SubNetwork (WSN), Soft-SubNetwork (SoftNet)☆44Updated last year
- 비디오 기반 인공지능 대화시스템☆14Updated last year
- [ECCV'24] Official code for "BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation"☆42Updated 9 months ago
- [INTERSPEECH'24] Official code for "LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition"☆32Updated last month
- Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models (ICML 2025)☆49Updated last month
- Policy Learning from Large Vision-Language Model Feedback Without Reward Modeling (IROS 2025)☆34Updated last month