PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 6 years ago
Alternatives and similar repositories for Robust_e2e_gan
Users that are interested in Robust_e2e_gan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43May 23, 2023Updated 2 years ago
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆31Nov 22, 2022Updated 3 years ago
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)☆17Nov 22, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Jun 17, 2022Updated 3 years ago
- Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.☆23Jul 24, 2020Updated 5 years ago
- Official PyTorch implementation of the paper: "Deep Audio Waveform Prior" (Interspeech 2022) https://arxiv.org/abs/2207.10441☆11Oct 25, 2022Updated 3 years ago
- ☆18Nov 10, 2019Updated 6 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆55Dec 6, 2023Updated 2 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- ☆55Jun 15, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆12Nov 30, 2021Updated 4 years ago
- Speech Commands Recognition using end-to-end deep learning models in pytorch☆28Oct 8, 2020Updated 5 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- WeNet 实战课程作业☆20Oct 7, 2022Updated 3 years ago
- This repository provides you the details of how speech recognition is done from end to end.☆25Apr 22, 2019Updated 6 years ago
- Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos☆10Sep 2, 2024Updated last year
- MFCC implementation with detailed comments.☆17Nov 26, 2020Updated 5 years ago
- Technologies for binaurally reproducing ultrasonic and underwater sound sources, such that they are both audible and localisable by a lis…☆21Jan 13, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆14Jun 23, 2022Updated 3 years ago
- Voice activity detection (VAD) library and Go bindings based on WebRTC's VAD engine☆11Mar 1, 2018Updated 8 years ago
- Implementation of Harmonic Convolution by Harmonic Lowering☆17Nov 11, 2020Updated 5 years ago
- ☆50Dec 26, 2020Updated 5 years ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated 2 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated last year
- Tensorflow training scripts for depthwise separable convolutional neural networks for keyword spotting, and C++ code for deployment.☆41Apr 2, 2020Updated 5 years ago
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆346Sep 5, 2020Updated 5 years ago
- Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."☆44Apr 16, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Apr 15, 2020Updated 5 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- Deep Learning Based Monaural Speech Dereverberation Models: Hope We Can Get Better Performance of Dereverberation☆20Mar 16, 2022Updated 4 years ago
- pYIN pitch detection implementation with librosa and python 3☆14Jul 16, 2019Updated 6 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Gammatone feature for robust speech recognition☆14Aug 1, 2016Updated 9 years ago
- 基于深度学习的语音增强、去混响☆100Jan 30, 2024Updated 2 years ago