AVSpeech downloader
☆69Jan 30, 2019Updated 7 years ago
Alternatives and similar repositories for avspeech-downloader
Users that are interested in avspeech-downloader are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments☆43Mar 29, 2021Updated 5 years ago
- Simple python script for downloading AVSpeech Dataset☆47Mar 16, 2024Updated 2 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Include some core functions and model to handle speech separation☆156Jun 24, 2021Updated 4 years ago
- Looking to listen at cocktail party☆36Mar 24, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Mar 24, 2023Updated 3 years ago
- CHiME-5 Baseline Array Synchronisation☆12Sep 24, 2018Updated 7 years ago
- ☆16Apr 27, 2025Updated last year
- ☆20Oct 3, 2023Updated 2 years ago
- Audio Source Separation using Neural Networks☆24Apr 27, 2018Updated 8 years ago
- Executable code based on Google articles☆168Dec 8, 2022Updated 3 years ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆36Mar 22, 2021Updated 5 years ago
- A python implementation of a traditional Dynamic Range Compressor☆14Oct 30, 2020Updated 5 years ago
- Out of time: automated lip sync in the wild☆887Apr 17, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17☆115Feb 15, 2017Updated 9 years ago
- Text Recognition and Detection based on Pixel-Link paper implemented in pytorch☆28May 30, 2023Updated 3 years ago
- converting the pretrained tensorflow SoundNet model to pytorch☆14Jun 15, 2022Updated 4 years ago
- ☆19Apr 1, 2020Updated 6 years ago
- ☆40Jul 19, 2018Updated 7 years ago
- Pytorch implemention of SDNet☆23Jun 1, 2021Updated 5 years ago
- ☆53May 15, 2025Updated last year
- Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments☆112Mar 19, 2024Updated 2 years ago
- Boundaries and Region Representation Fusion☆12Mar 24, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A unofficial Pytorch implementation of Microsoft's PHASEN☆232Apr 10, 2024Updated 2 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆27Jan 11, 2022Updated 4 years ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆75Jul 5, 2019Updated 6 years ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆402Feb 4, 2019Updated 7 years ago
- ☆14Aug 10, 2015Updated 10 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- An STFT/iSTFT for PyTorch.☆371Oct 31, 2023Updated 2 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features☆225Jul 17, 2019Updated 6 years ago
- Learn and L3 embedding from audio/video pairs☆89Apr 24, 2022Updated 4 years ago
- Problem Agnostic Speech Encoder☆447Jul 6, 2023Updated 2 years ago
- ☆10Apr 12, 2016Updated 10 years ago
- LinuxShell编程笔记☆15Aug 29, 2017Updated 8 years ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆349Sep 5, 2020Updated 5 years ago
- SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition☆131Jun 7, 2024Updated 2 years ago