taegyeong-lee / Generating-Realistic-Images-from-In-the-wild-Sounds
Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023
☆11Updated last week
Alternatives and similar repositories for Generating-Realistic-Images-from-In-the-wild-Sounds:
Users that are interested in Generating-Realistic-Images-from-In-the-wild-Sounds are comparing it to the libraries listed below
- official implementation of 'STREAM : Spatio-TempoRal Evaluation and Analysis Metric for Video Generative Models'☆26Updated 8 months ago
- Official repository for LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data (CVPR 2023)☆143Updated last year
- NeurIPS 2023 - TopP&R: Robust Support Estimation Approach for Evaluating Fidelity and Diversity in Generative Models Official Code☆103Updated 8 months ago
- Can We Find Strong Lottery Tickets in Generative Models? - Official Code (Pytorch)☆99Updated 8 months ago
- Official Pytorch Implementation for "Fix the Noise: Disentangling Source Feature for Controllable Domain Translation" (CVPR 2023, CVPRW 2…☆179Updated last year
- BoIR: Box-Supervised Instance Representation for Multi-Person Pose Estimation☆97Updated last year
- Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations☆110Updated last year
- code of RMFER: Semi-supervised Contrastive Learning for Facial Expression Recognition with Reaction Mashup Video☆90Updated last year
- Carousel Memory: Rethinking the Design of Episodic Memory for Continual Learning☆83Updated 2 years ago
- ☆106Updated last year
- ☆94Updated 2 years ago
- [NeurIPS 2023] Softmax Output Approximation for Activation Memory-Efficient Training of Attention-based Networks☆82Updated 9 months ago
- Domain generalization method code based on DomainBed☆101Updated 2 years ago
- ☆33Updated 5 months ago
- (WACV'24) MICS: Midpoint Interpolation to Learn Compact and Separated Representations for Few-Shot Class-Incremental Learning☆85Updated last year
- A repository of a paper named "Can We Use Diffusion Probabilistic Models for 3D Motion Prediction?", accepted to ICRA 2023.☆107Updated last year
- ☆85Updated 2 years ago
- ☆79Updated 2 years ago
- ☆81Updated 3 months ago
- [SenSys 2023] On-NAS: On-Device Neural Architecture Search on Memory-Constrained Intelligent Embedded Systems☆90Updated last year
- Zico (ATC'21) source code (based on TensorFlow 1.13)☆73Updated last year
- [ECCV’24] Official Implementation for CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenario…☆52Updated 7 months ago
- ☆26Updated 2 weeks ago
- CVPR 24 paper: Dysen-VDM: Empowering Dynamics-aware Text-to-Video Diffusion with LLMs☆11Updated last year
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆140Updated 8 months ago
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Following☆30Updated 2 months ago
- This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".☆78Updated 2 months ago
- ☆71Updated 3 years ago
- Official code for EasyDrag (CVPR 2024)☆14Updated 9 months ago
- formulate diet optimization as sequence generation that produces a diet of recommended intake☆75Updated 3 years ago