kuai-lab/sound-guided-semantic-image-manipulation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kuai-lab/sound-guided-semantic-image-manipulation)

kuai-lab / sound-guided-semantic-image-manipulation

Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)

☆80

Alternatives and similar repositories for sound-guided-semantic-image-manipulation

Users that are interested in sound-guided-semantic-image-manipulation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kuai-lab / soundini-official
View on GitHub
We are committing code.
☆44May 18, 2023Updated 3 years ago
iknoom / Problem_Solving
View on GitHub
나의 알고리즘 문제해결
☆10Sep 12, 2022Updated 3 years ago
MICV-yonsei / EAGLE
View on GitHub
[CVPR 2024 Highlight✨] Official Pytorch Code for EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation
☆93Sep 12, 2024Updated last year
MICV-yonsei / CXRL
View on GitHub
[MICCAI 2024 Spotlight✨] Official Pytorch Code for Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning
☆13Sep 4, 2024Updated last year
all1m-algorithm-study / uospc
View on GitHub
All about University of Seoul Programing Contest.
☆13Dec 4, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
guyyariv / AudioToken
View on GitHub
[InterSpeech 2023] The official PyTorch implementation of: "AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Imag…
☆89May 18, 2026Updated 2 months ago
Sejong-Talk-With / ReviewForYou
View on GitHub
한이음 ICT 멘토링 - 자연어처리(NLP)와 머신러닝을 이용한 리뷰 데이터 분석 및 사용자 경험 분석
☆10Feb 24, 2022Updated 4 years ago
Tinglok / avstyle
View on GitHub
Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)
☆15Jan 26, 2023Updated 3 years ago
ku-vai / TPoS
View on GitHub
This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)
☆25Dec 7, 2023Updated 2 years ago
facebookresearch / SemanticImageTranslation
View on GitHub
Evaluation benchmark for the task of Semantic Image Translation. Contains code to run FlexIT (CVPR 2022)
☆35Mar 25, 2022Updated 4 years ago
kaist-ami / Sound2Scene
View on GitHub
☆42Apr 14, 2025Updated last year
SunnerLi / Cross-you-in-style
View on GitHub
Official implementation of the ACM MM 2020 paper
☆16Apr 27, 2021Updated 5 years ago
AIHub-1 / AIHub-Brain
View on GitHub
AI Innovation Hub BCI Project
☆33Dec 12, 2023Updated 2 years ago
krantiparida / awesome-audio-visual
View on GitHub
A curated list of different papers and datasets in various areas of audio-visual processing
☆775Jan 30, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
BurakCanBiner / SonicDiffusion
View on GitHub
☆43Nov 8, 2024Updated last year
boostcampaitech3 / final-project-level3-cv-17
View on GitHub
[2022.05.16 ~ 2022.06.10] 🌤️미세먼지 없는 맑은 사진📷 - 부스트캠프 AI Tech 3기 최종 프로젝트
☆14Jun 11, 2022Updated 4 years ago
Takaaki-Saeki / ssl_speech_restoration_v2
View on GitHub
☆17Dec 18, 2023Updated 2 years ago
sejong-rcv / URP
View on GitHub
2026년 하계 학부연구생 사전 신청서 및 주의사항 (2026.07.01-2026.08.31)
☆10Mar 10, 2026Updated 4 months ago
daniel03c1 / NAS_VAD
View on GitHub
☆26Oct 25, 2024Updated last year
jasongief / LEAP
View on GitHub
[2024 ECCV] Label-anticipated Event Disentanglement for Audio-Visual Video Parsing
☆14Nov 17, 2024Updated last year
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
IFICL / SLfM
View on GitHub
Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
☆43Updated this week
appletea233 / LLaVA-ST
View on GitHub
[CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
☆84Jul 4, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jinbae-s / ACVIS
View on GitHub
[ICASSP 2026] The official pytorch implementation of ACVIS
☆15Jan 19, 2026Updated 6 months ago
Pseudo-Lab / Pseudo3DV
View on GitHub
☆13Mar 29, 2025Updated last year
PardoAlejo / LearningToCut
View on GitHub
Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies
☆51Nov 9, 2022Updated 3 years ago
KID-7391 / seeking-the-shape-of-sound
View on GitHub
☆19Jun 8, 2021Updated 5 years ago
WenFuLee / CS-766-Computer-Vision
View on GitHub
☆15May 8, 2018Updated 8 years ago
baolp / demoireing_with_focused_and_defocused_images_pairs
View on GitHub
☆25Jan 14, 2021Updated 5 years ago
stoneMo / EZ-VSL
View on GitHub
Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)
☆41Oct 2, 2022Updated 3 years ago
yj-yu / CiSIN
View on GitHub
Character Grounding and Re-Identification in Story of Videos and Text Descriptions
☆10Jan 17, 2021Updated 5 years ago
guyii54 / Real-Rainy-Image-Datasets
View on GitHub
A summary for existing real rain images datasets
☆11Mar 24, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shlee-lab / KUThesis2022
View on GitHub
This repository is for Korea University Thesis / Dissertation LaTex Template
☆18Jan 2, 2024Updated 2 years ago
yangdongchao / Text-to-sound-Synthesis
View on GitHub
The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
☆366Aug 3, 2023Updated 2 years ago
choijeongsoo / av2av
View on GitHub
[CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation
☆48Sep 6, 2024Updated last year
datamarket-tobigs / Cross-Cutting
View on GitHub
[제 10회 투빅스 컨퍼런스] AI 아이돌 교차편집
☆44Nov 29, 2021Updated 4 years ago
zexupan / USEV
View on GitHub
☆14Jul 1, 2024Updated 2 years ago
alvinliu0 / Visual-Sound-Localization-in-the-Wild
View on GitHub
Code for Visual Sound Localization in the Wild by Cross-Modal Interference Erasing (AAAI 2022).
☆29Feb 15, 2022Updated 4 years ago
LoieSun / Auto-ACD
View on GitHub
code for A Large-scale Dataset for Audio-Language Representation Learning
☆14Sep 18, 2024Updated last year