hangzhaomit / Sound-of-Pixels
Codebase for ECCV18 "The Sound of Pixels"
☆371Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Sound-of-Pixels
- MUSIC Dataset from The Sound of Pixels (ECCV '18)☆118Updated 2 years ago
- Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features☆220Updated 5 years ago
- Co-Separating Sounds of Visual Objects (ICCV 2019)☆93Updated last year
- 2.5D visual sound☆110Updated last year
- ☆223Updated 4 years ago
- VGGSound: A Large-scale Audio-Visual Dataset☆291Updated 3 years ago
- 2.5D visual sound dataset☆92Updated 3 years ago
- Learning to Separate Object Sounds by Watching Unlabeled Video (ECCV 2018)☆49Updated 5 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 6 years ago
- Implementation of the Wave-U-Net for audio source separation☆844Updated last year
- Pytorch port of Google Research's VGGish model used for extracting audio features.☆377Updated 3 years ago
- A library for soundscape synthesis and augmentation☆379Updated 2 years ago
- Audio Source Separation Without Any Training Data.☆157Updated 7 months ago
- TensorFlow implementation of "SoundNet".☆145Updated 6 years ago
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆354Updated last year
- DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.☆349Updated 4 years ago
- Deep Convolutional Neural Networks for Musical Source Separation☆471Updated 4 years ago
- A neural network for end-to-end music source separation☆225Updated 4 years ago
- Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"☆111Updated 3 years ago
- Audio-Visual Speech Separation with Cross-Modal Consistency☆221Updated last year
- Codebase and Dataset for the paper: Learning to Localize Sound Source in Visual Scenes☆84Updated 3 years ago
- Spatial Audio Generation☆100Updated last year
- Include some core functions and model to handle speech separation☆154Updated 3 years ago
- Audio-Visual Event Localization in Unconstrained Videos, ECCV 2018☆171Updated 3 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆348Updated 3 years ago
- An open source dataset for source separation☆378Updated 9 months ago
- Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional re…☆338Updated last year
- Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.☆83Updated 5 years ago
- Python parser and tools for MUSDB18 Music Separation Dataset☆161Updated 11 months ago