kaist-ami/SoundBrush

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kaist-ami/SoundBrush)

kaist-ami / SoundBrush

☆14

Alternatives and similar repositories for SoundBrush

Users that are interested in SoundBrush are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kaist-ami / SMILE-Dataset
View on GitHub
[NAACL'24] Repository for "SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models"
☆15Jun 18, 2024Updated 2 years ago
kaist-ami / AVHBench
View on GitHub
[ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"
☆25Mar 8, 2026Updated 4 months ago
kaist-ami / Sound2Scene
View on GitHub
☆42Apr 14, 2025Updated last year
kaist-ami / Hand-Uncertainty
View on GitHub
[BMVC'25] Official repository for "Learning Correlation-aware Aleatoric Uncertainty for 3D Hand Pose Estimation"
☆23Dec 8, 2025Updated 7 months ago
kaist-ami / Uni-DVPS
View on GitHub
[RA-L'24, IROS'24] Official PyTorch Implementation of "Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation"
☆13Oct 11, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Web-Engine / pubg-minimap-replay
View on GitHub
The component to replay mini-map of the game "Player Unknown Battle Ground"
☆16Jan 4, 2023Updated 3 years ago
tail95 / Voice-Cloning
View on GitHub
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆10Aug 1, 2019Updated 6 years ago
RanaCM / DSU-AVO
View on GitHub
Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023
☆12May 13, 2024Updated 2 years ago
kaist-ami / voicecraft-dub
View on GitHub
[ICCV'25] Official PyTorch Implementation of "VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models"
☆17Dec 8, 2025Updated 7 months ago
BurakCanBiner / SonicDiffusion
View on GitHub
☆43Nov 8, 2024Updated last year
kaist-ami / Deep-Motion-Mag-Pytorch
View on GitHub
[ECCV'18] Author-verified Pytorch Reimplementation of "Learning-based Video Motion Magnification"
☆16Dec 23, 2024Updated last year
kaist-ami / FPRF
View on GitHub
[AAAI'24] Official PyTorch implementation of the paper "FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radianc…
☆18Nov 29, 2024Updated last year
AgentCooper2002 / EDMSound
View on GitHub
Codebase and project page for EDMSound
☆35Nov 20, 2023Updated 2 years ago
kaist-ami / TextManiA
View on GitHub
[ICCV’23] Official repository for "TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation"
☆22Nov 1, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
luo-ziyuan / CopyRNeRF-code
View on GitHub
☆33Nov 4, 2023Updated 2 years ago
ku-vai / TPoS
View on GitHub
This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)
☆25Dec 7, 2023Updated 2 years ago
heng-hw / V2A-Mapper
View on GitHub
[AAAI 2024] V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models
☆29Dec 14, 2023Updated 2 years ago
WikiChao / FreSca
View on GitHub
[CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model
☆55May 31, 2025Updated last year
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
kaist-ami / Automated-Model-Discovery
View on GitHub
[NeurIPS'25] Automated Model Discovery via Multi-modal & Multi-step Pipeline
☆22Dec 10, 2025Updated 7 months ago
kaist-ami / JointDiT
View on GitHub
[ICCV'25] Official PyTorch Implementation of "JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers"
☆31Nov 27, 2025Updated 7 months ago
kuai-lab / soundini-official
View on GitHub
We are committing code.
☆44May 18, 2023Updated 3 years ago
yuhanghe01 / Sound3DVDet
View on GitHub
Code for WACV24 work for multiview acoustic-visual detection
☆13Mar 22, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
kaist-ami / Axial-mm
View on GitHub
[ECCV'24] Official PyTorch Implementation of "Learning-based Axial Video Motion Magnification"
☆28Dec 22, 2024Updated last year
JiuFengSC / ElasticAST
View on GitHub
Official code of ElasticAST (Interspeech 2024 paper)
☆34Jul 30, 2024Updated last year
psky1111 / Tencent-TSSR
View on GitHub
Official implementation of TSSR
☆16Mar 5, 2026Updated 4 months ago
ilpoviertola / V-AURA
View on GitHub
The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)
☆35Feb 11, 2026Updated 5 months ago
gqq1210 / AS-UNet
View on GitHub
☆11Apr 4, 2021Updated 5 years ago
cvlab-columbia / paperbot
View on GitHub
PaperBot: Learning to Design Real-World Tools Using Paper
☆13Mar 15, 2024Updated 2 years ago
kaistmm / SSLalignment
View on GitHub
☆37May 28, 2025Updated last year
zc2023 / TokenHPE
View on GitHub
(CVPR 2023) TokenHPE: Learning Orientation Tokens for Efficient Head Pose Estimation via Transformers
☆14Oct 29, 2023Updated 2 years ago
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
Emanuele97x / DreamCache
View on GitHub
DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching (CVPR'25)
☆20Jun 3, 2025Updated last year
gudgud96 / noisy-student-emotion-training
View on GitHub
Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging
☆11Dec 2, 2021Updated 4 years ago
cdb342 / ALOcc
View on GitHub
[ICCV 2025] ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Prediction
☆52Dec 1, 2025Updated 7 months ago
MCR-PEFT / C-MCR
View on GitHub
☆44Updated this week
DTaoo / DMC
View on GitHub
Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)
☆15May 27, 2020Updated 6 years ago
cvlab-columbia / trajectories
View on GitHub
Code for the paper "Representing Spatial Trajectories as Distributions"
☆13Jan 17, 2023Updated 3 years ago
KranthiKumarR / Localize-to-Binauralize
View on GitHub
Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)
☆10Oct 11, 2021Updated 4 years ago