PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 6 years ago
Alternatives and similar repositories for Robust_e2e_gan
Users that are interested in Robust_e2e_gan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆31Nov 22, 2022Updated 3 years ago
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Jun 17, 2022Updated 3 years ago
- Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.☆23Jul 24, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441☆11Oct 25, 2022Updated 3 years ago
- ☆18Nov 10, 2019Updated 6 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆56Dec 6, 2023Updated 2 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- ☆55Jun 15, 2020Updated 5 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆12Nov 30, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Keras Functional API for multiple inputs and mixed data☆11Feb 18, 2019Updated 7 years ago
- Speech Commands Recognition using end-to-end deep learning models in pytorch☆28Oct 8, 2020Updated 5 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- WeNet 实战课程作业☆20Oct 7, 2022Updated 3 years ago
- This repository provides you the details of how speech recognition is done from end to end.☆25Apr 22, 2019Updated 7 years ago
- Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos☆10Sep 2, 2024Updated last year
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆14Jun 23, 2022Updated 3 years ago
- Implementation of Harmonic Convolution by Harmonic Lowering☆17Nov 11, 2020Updated 5 years ago
- ☆50Dec 26, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated last year
- Tensorflow training scripts for depthwise separable convolutional neural networks for keyword spotting, and C++ code for deployment.☆41Apr 2, 2020Updated 6 years ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆347Sep 5, 2020Updated 5 years ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆41Mar 13, 2024Updated 2 years ago
- Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."☆44Apr 16, 2019Updated 7 years ago
- implementing beamforming algorithm in C++☆11Jan 9, 2020Updated 6 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Apr 15, 2020Updated 6 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- Deep Learning Based Monaural Speech Dereverberation Models: Hope We Can Get Better Performance of Dereverberation☆20Mar 16, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 5 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆21Nov 25, 2016Updated 9 years ago
- 基于深度学习的语音增强、去混响☆101Jan 30, 2024Updated 2 years ago
- Implementation of a Spiking Neural Network in Tensorflow.☆24May 13, 2018Updated 7 years ago
- A fourier-based audio-synthesiser wrote in MATLAB as a university project.☆12Jan 19, 2019Updated 7 years ago
- Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM☆122Nov 20, 2019Updated 6 years ago
- This repository is webrtc agc module demo.☆12Jan 23, 2019Updated 7 years ago