changil/avspeech-downloader

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/changil/avspeech-downloader)

changil / avspeech-downloader

AVSpeech downloader

☆69

Alternatives and similar repositories for avspeech-downloader

Users that are interested in avspeech-downloader are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

maxhollmann / voxceleb-luigi
View on GitHub
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
☆43Mar 29, 2021Updated 5 years ago
naba89 / AVSpeechDownloader
View on GitHub
Simple python script for downloading AVSpeech Dataset
☆47Mar 16, 2024Updated 2 years ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
bill9800 / speech_separation
View on GitHub
Include some core functions and model to handle speech separation
☆156Jun 24, 2021Updated 5 years ago
mayurnewase / looking-to-listen-at-cocktail-party
View on GitHub
Looking to listen at cocktail party
☆36Mar 24, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chimechallenge / chime5-synchronisation
View on GitHub
CHiME-5 Baseline Array Synchronisation
☆12Sep 24, 2018Updated 7 years ago
BAI-Yeqi / SF2F_PyTorch
View on GitHub
☆16Apr 27, 2025Updated last year
JusperLee / Looking-to-Listen-at-the-Cocktail-Party
View on GitHub
Executable code based on Google articles
☆166Dec 8, 2022Updated 3 years ago
josephch405 / jit-masker
View on GitHub
☆20Oct 3, 2023Updated 2 years ago
djmoffat / pyCompressor
View on GitHub
A python implementation of a traditional Dynamic Range Compressor
☆14Oct 30, 2020Updated 5 years ago
mayank-git-hub / Text-Recognition
View on GitHub
Text Recognition and Detection based on Pixel-Link paper implemented in pytorch
☆28May 30, 2023Updated 3 years ago
arielephrat / vid2speech
View on GitHub
Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17
☆115Feb 15, 2017Updated 9 years ago
joonson / syncnet_python
View on GitHub
Out of time: automated lip sync in the wild
☆895Apr 17, 2026Updated 3 months ago
smallflyingpig / SoundNet_Pytorch
View on GitHub
converting the pretrained tensorflow SoundNet model to pytorch
☆14Jun 15, 2022Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
crystal-method / Looking-to-Listen
View on GitHub
☆40Jul 19, 2018Updated 8 years ago
aispeech-lab / SDNet
View on GitHub
Pytorch implemention of SDNet
☆23Jun 1, 2021Updated 5 years ago
oranshayer / BRRF
View on GitHub
Boundaries and Region Representation Fusion
☆12Mar 24, 2023Updated 3 years ago
cyrta / voxceleb
View on GitHub
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
☆77Jul 5, 2019Updated 7 years ago
aispeech-lab / WASE
View on GitHub
PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…
☆27Jan 11, 2022Updated 4 years ago
donchev7 / MatlabCode
View on GitHub
☆14Aug 10, 2015Updated 10 years ago
dr-pato / audio_visual_speech_enhancement
View on GitHub
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆112Mar 19, 2024Updated 2 years ago
ronggong / phoneticSimilarity
View on GitHub
phonetic similarity algorithms
☆13Jun 19, 2018Updated 8 years ago
MlWoo / sentence2pinyin
View on GitHub
tts fronted-end
☆11Dec 19, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
a-nagrani / VGGVox
View on GitHub
VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets
☆401Feb 4, 2019Updated 7 years ago
andrewowens / multisensory
View on GitHub
Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
☆225Jul 17, 2019Updated 7 years ago
AjayNandoriya / EPRESERVE
View on GitHub
Implementation for Face Illumination Transfer through Edge-preserving Filters CVPR11
☆13Dec 21, 2017Updated 8 years ago
fgnt / ci_sdr
View on GitHub
☆53May 15, 2025Updated last year
wangzhengyi / EventBusAnalysis
View on GitHub
☆10Apr 12, 2016Updated 10 years ago
WangYihang / LinuxShellScript
View on GitHub
LinuxShell编程笔记
☆15Aug 29, 2017Updated 8 years ago
onolab-tmu / code_2020ICASSP_iss
View on GitHub
Code to reproduce the experiments in the paper "Fast and stable blind source separation with rank-1 updates" presented at ICASSP 2020.
☆22Apr 14, 2020Updated 6 years ago
YoavRamon / Speech-Recognition-Israel
View on GitHub
The repository for Speech Recognition Israel meetup group. It is used to material collection and sharing.
☆13Jul 12, 2020Updated 6 years ago
pseeth / torch-stft
View on GitHub
An STFT/iSTFT for PyTorch.
☆372Oct 31, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Audio-WestlakeU / Narrowband_DeepFiltering
View on GitHub
☆19Apr 1, 2020Updated 6 years ago
santi-pdp / pase
View on GitHub
Problem Agnostic Speech Encoder
☆446Jul 6, 2023Updated 3 years ago
Sytronik / deep-griffinlim-iteration
View on GitHub
PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)
☆39Oct 12, 2019Updated 6 years ago
haoxiangsnr / A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancement
View on GitHub
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…
☆350Sep 5, 2020Updated 5 years ago
aispeech-lab / LiMuSE
View on GitHub
PyTorch implementation of LiMuSE
☆33Oct 11, 2022Updated 3 years ago
nay0648 / bssaec2020
View on GitHub
A New Perspective of Auxiliary-Function-Based Independent Component Analysis in Acoustic Echo Cancellation
☆50Jan 13, 2021Updated 5 years ago
LvHang / pitch
View on GitHub
a standalone pitch extractor
☆13Oct 19, 2017Updated 8 years ago