kookie12 / FlexiEditLinks
[ECCV 2024] FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing
☆74Updated 4 months ago
Alternatives and similar repositories for FlexiEdit
Users that are interested in FlexiEdit are comparing it to the libraries listed below
Sorting:
- ICML 2024, Official Implementation of "Cross-view Masked Diffusion Transformers for Person Image Synthesis."☆53Updated last year
- [CVPR 2025] ITA-MDT official implementation☆64Updated 2 months ago
- ☆25Updated 9 months ago
- [ECCV'24] Official code for "BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation"☆42Updated last year
- Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (…☆48Updated last year
- ☆25Updated 9 months ago
- ☆17Updated last year
- Retrieval_OOD_for_Multimodal_AI☆11Updated last year
- FRAG: Frequency Adaptive Group for Diffusion Video Editing (ICML 2024)☆70Updated 3 months ago
- This repository is the official implementation of the paper: Physics Informed Distillation for Diffusion Models, accepted by Transactions…☆53Updated 3 weeks ago
- [ECCV'22] SQuiDNet: Selective Query-guided Debiasing Network for Video Corpus Moment Retrieval☆73Updated 3 years ago
- [ICLR'23] ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure☆40Updated last year
- [ICLR'25] MDSGen: Fast and Efficient Masked Diffusion Temporal-Aware Transformers for Open-Domain Sound Generation☆38Updated 5 months ago
- Predictive Coding for Decision Transformer (IROS 2024)☆41Updated 6 months ago
- [ICML'25 Spotlight] FlowDrag: 3D-aware Drag-based Image Editing with Mesh-guided Deformation Vector Flow Fields☆45Updated 5 months ago
- [INTERSPEECH'24] Official code for "LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition"☆33Updated 5 months ago
- [ICLR'25] Official code for "Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models"☆34Updated 7 months ago
- Policy Learning from Large Vision-Language Model Feedback Without Reward Modeling (IROS 2025)☆36Updated 2 months ago
- 비디오 기반 인공지능 대화시스템☆14Updated last year
- Dual-scale Doppler Attention for Human Identification☆47Updated 4 months ago
- Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval☆65Updated 4 years ago
- (ICCV2025) Occlusion-robust Stylization for Drawing-based 3D Animation☆49Updated 4 months ago
- Text-based Video Retrieval☆15Updated last year
- ☆33Updated last year
- Causal Localization Network for Radar Human Localization with micro-Doppler signature☆61Updated last year
- Winning SubNetwork (WSN), Soft-SubNetwork (SoftNet)☆43Updated last year
- DNI: Dilutional Noise Initialization for Diffusion Video Editing (ECCV 2024)☆46Updated last year
- Enhancing Rating-Based Reinforcement Learning to Effectively Leverage Feedback from Large Vision-Language Models (ICML 2025)☆52Updated 4 months ago
- Multimodal_AI_Video_Dialogue☆16Updated last year
- HEAR: Hearing Enhanced Audio Response for Video-grounded Dialogue, EMNLP 2023 (long, findings) [STARLAB] Audio Enhancement for video-dial…☆57Updated last year