JusperLee/Looking-to-Listen-at-the-Cocktail-Party

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JusperLee/Looking-to-Listen-at-the-Cocktail-Party)

JusperLee / Looking-to-Listen-at-the-Cocktail-Party

Executable code based on Google articles

☆166

Alternatives and similar repositories for Looking-to-Listen-at-the-Cocktail-Party

Users that are interested in Looking-to-Listen-at-the-Cocktail-Party are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bill9800 / speech_separation
View on GitHub
Include some core functions and model to handle speech separation
☆156Jun 24, 2021Updated 5 years ago
JusperLee / ExamOnline
View on GitHub
This is a complete online exam system
☆10Dec 27, 2019Updated 6 years ago
JusperLee / Arxiv-New-Paper-Server
View on GitHub
Arxiv automatically obtains the latest article service.
☆11Apr 29, 2020Updated 6 years ago
danmic / av-se
View on GitHub
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
☆222Apr 16, 2023Updated 3 years ago
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
JusperLee / Dual-Path-RNN-Pytorch
View on GitHub
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
☆468Feb 14, 2023Updated 3 years ago
JusperLee / DANet-For-Speech-Separation
View on GitHub
Pytorch implement of DANet For Speech Separation
☆21Jan 9, 2020Updated 6 years ago
facebookresearch / VisualVoice
View on GitHub
Audio-Visual Speech Separation with Cross-Modal Consistency
☆250Jul 25, 2023Updated 3 years ago
JusperLee / Calculate-SNR-SDR
View on GitHub
Script to calculate SNR and SDR using python
☆93Jul 7, 2020Updated 6 years ago
JusperLee / UtterancePIT-Speech-Separation
View on GitHub
According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.
☆67Apr 14, 2020Updated 6 years ago
mayurnewase / looking-to-listen-at-cocktail-party
View on GitHub
Looking to listen at cocktail party
☆36Mar 24, 2023Updated 3 years ago
JusperLee / Conv-TasNet
View on GitHub
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
☆550May 26, 2023Updated 3 years ago
dr-pato / audio_visual_speech_enhancement
View on GitHub
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆112Mar 19, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
JusperLee / Deep-Clustering-for-Speech-Separation
View on GitHub
Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation
☆133Jul 14, 2020Updated 6 years ago
meokz / looking-to-listen
View on GitHub
Deep neural network (DNN) for noise reduction, removal of background music, and speech separation
☆173Nov 21, 2022Updated 3 years ago
JusperLee / awesome-speech-enhancement
View on GitHub
speech enhancement\speech seperation\sound source localization
☆15Apr 22, 2020Updated 6 years ago
JusperLee / Speech-Separation-Paper-Tutorial
View on GitHub
A must-read paper for speech separation based on neural networks
☆952Aug 11, 2025Updated 11 months ago
zexupan / reentry
View on GitHub
☆18Nov 22, 2024Updated last year
JusperLee / Deep-Encoder-Decoder-Conv-TasNet
View on GitHub
A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "
☆51Apr 20, 2020Updated 6 years ago
JusperLee / LRS3-For-Speech-Separation
View on GitHub
Multi-modal speech separation task data generation script on LRS3 data set.
☆88Feb 2, 2024Updated 2 years ago
gemengtju / Tutorial_Separation
View on GitHub
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly i…
☆484Jan 9, 2021Updated 5 years ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zexupan / USEV
View on GitHub
☆14Jul 1, 2024Updated 2 years ago
DanielMengLiu / DeepLip
View on GitHub
deep-learning based audio-visual lip bometrics
☆15May 9, 2023Updated 3 years ago
JorisCos / LibriMix
View on GitHub
An open source dataset for source separation
☆502Feb 9, 2024Updated 2 years ago
afourast / avobjects
View on GitHub
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
☆114Nov 16, 2020Updated 5 years ago
kaituoxu / Conv-TasNet
View on GitHub
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…
☆769Apr 6, 2023Updated 3 years ago
Andong-Li-speech / MDNet
View on GitHub
The implementation of MDNet, which is in submission to Interspeech2022
☆14May 1, 2022Updated 4 years ago
funcwj / setk
View on GitHub
Tools for Speech Enhancement integrated with Kaldi
☆432Jul 6, 2023Updated 3 years ago
changil / avspeech-downloader
View on GitHub
AVSpeech downloader
☆69Jan 30, 2019Updated 7 years ago
JusperLee / AFRCNN-For-Speech-Separation
View on GitHub
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network
☆131Mar 28, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JuanFMontesinos / VoViT
View on GitHub
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
☆35Mar 18, 2023Updated 3 years ago
JackSyu / Discriminative-Multi-modality-Speech-Recognition
View on GitHub
TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"
☆26Apr 27, 2022Updated 4 years ago
lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
WenzheLiu-Speech / awesome-speech-enhancement
View on GitHub
speech enhancement\speech seperation\sound source localization
☆1,244Nov 14, 2023Updated 2 years ago
yluo42 / TAC
View on GitHub
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
☆311Jun 15, 2021Updated 5 years ago
mpc001 / end-to-end-lipreading
View on GitHub
Pytorch code for End-to-End Audiovisual Speech Recognition
☆183Nov 18, 2022Updated 3 years ago
naplab / Conv-TasNet
View on GitHub
☆337Feb 28, 2020Updated 6 years ago