kuai-lab/soundini-official

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kuai-lab/soundini-official)

kuai-lab / soundini-official

We are committing code.

☆44

Alternatives and similar repositories for soundini-official

Users that are interested in soundini-official are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kuai-lab / sound-guided-semantic-image-manipulation
View on GitHub
Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)
☆80Aug 14, 2023Updated 2 years ago
ku-vai / TPoS
View on GitHub
This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)
☆25Dec 7, 2023Updated 2 years ago
Tinglok / avstyle
View on GitHub
Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)
☆15Jan 26, 2023Updated 3 years ago
stoneMo / EZ-VSL
View on GitHub
Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)
☆41Oct 2, 2022Updated 3 years ago
sejong-rcv / URP
View on GitHub
2026년 하계 학부연구생 사전 신청서 및 주의사항 (2026.07.01-2026.08.31)
☆10Mar 10, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Minglu58 / TA2V
View on GitHub
☆15Dec 1, 2025Updated 7 months ago
stoneMo / AVGN
View on GitHub
Official implementation for AVGN
☆41Mar 24, 2023Updated 3 years ago
kaist-ami / SoundBrush
View on GitHub
☆14Dec 8, 2025Updated 7 months ago
shlee-lab / KUThesis2022
View on GitHub
This repository is for Korea University Thesis / Dissertation LaTex Template
☆18Jan 2, 2024Updated 2 years ago
SunnerLi / Cross-you-in-style
View on GitHub
Official implementation of the ACM MM 2020 paper
☆16Apr 27, 2021Updated 5 years ago
guyyariv / AudioToken
View on GitHub
[InterSpeech 2023] The official PyTorch implementation of: "AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Imag…
☆89May 18, 2026Updated 2 months ago
salesforce / GlueGen
View on GitHub
☆65Jun 16, 2025Updated last year
LimHyungTae / Naverlabs-LiDAR-API
View on GitHub
LiDAR API of NAVERLABS indoor dataset
☆26Nov 19, 2021Updated 4 years ago
osamhack2020 / IoT_COVID19-Detector_CO-vision
View on GitHub
Identify and Alert people at risk of COVID-19 infection in REAL-TIME using raspberry-pi.
☆11Aug 5, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
HS-YN / PanoAVQA
View on GitHub
Official repository of PanoAVQA: Grounded Audio-Visual Question Answering in 360° Videos (ICCV 2021)
☆16Oct 12, 2021Updated 4 years ago
yiming-j / SPLINE-Net
View on GitHub
SPLINE-Net: Sparse Photometric Stereo through Lighting Interpolation and Normal Estimation Networks
☆11Apr 13, 2023Updated 3 years ago
WikiChao / VisAH
View on GitHub
[CVPR 2025] Pytorch implementation of the paper "Learning to Highlight Audio by Watching Movies"
☆15Oct 1, 2025Updated 9 months ago
rxtan2 / AVSeT
View on GitHub
☆17Oct 2, 2023Updated 2 years ago
hxixixh / mix-and-localize
View on GitHub
☆23Mar 20, 2024Updated 2 years ago
OpenNLPLab / MMVAE-AVS
View on GitHub
Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].
☆20Sep 19, 2024Updated last year
stoneMo / MGN
View on GitHub
Official implementation for MGN
☆20Dec 22, 2022Updated 3 years ago
iknoom / Problem_Solving
View on GitHub
나의 알고리즘 문제해결
☆10Sep 12, 2022Updated 3 years ago
all1m-algorithm-study / uospc
View on GitHub
All about University of Seoul Programing Contest.
☆13Dec 4, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
AgentCooper2002 / EDMSound
View on GitHub
Codebase and project page for EDMSound
☆35Nov 20, 2023Updated 2 years ago
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
kaist-ami / Sound2Scene
View on GitHub
☆42Apr 14, 2025Updated last year
xcmyz / ConvTasNet4BasisMelGAN
View on GitHub
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆21Jul 21, 2021Updated 4 years ago
mingen-pan / Reinforcement-Learning-Q-learning-8puzzle-Pytorch
View on GitHub
This is a project using neural-network reinforcement learning to solve the 8 puzzle problem (or even N puzzle)
☆12Mar 24, 2018Updated 8 years ago
junwenxiong / diff_sal
View on GitHub
Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
☆29May 26, 2024Updated 2 years ago
WikiChao / Ego-AV-Loc
View on GitHub
[CVPR 2023] Egocentric Audio-Visual Object Localization
☆27Jan 6, 2024Updated 2 years ago
ysoh20 / Prometheus-Team1
View on GitHub
☆25Aug 1, 2025Updated 11 months ago
yzxing87 / Seeing-and-Hearing
View on GitHub
[CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
☆155Jul 6, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
brincolab / High-Order-interactions
View on GitHub
High-Order interactions
☆12Jul 5, 2024Updated 2 years ago
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
JishengBai / ICME2024ASC
View on GitHub
baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift
☆18Mar 16, 2024Updated 2 years ago
CandleLabAI / TPFNet
View on GitHub
☆11Dec 26, 2022Updated 3 years ago
showlab / VisorGPT
View on GitHub
[NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT
☆138May 4, 2024Updated 2 years ago
daeunni / BECoTTA
View on GitHub
Code for "BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation [ICML2024]".
☆51Jun 16, 2024Updated 2 years ago
ZYH-Lightyear / LVAS
View on GitHub
LVAS-Agent Code Base
☆21Apr 15, 2025Updated last year