pengchengguo/espnet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pengchengguo/espnet)

pengchengguo / espnet

End-to-End Speech Processing Toolkit

☆11

Alternatives and similar repositories for espnet

Users that are interested in espnet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Andong-Li-speech / MDNet
View on GitHub
The implementation of MDNet, which is in submission to Interspeech2022
☆14May 1, 2022Updated 4 years ago
JonathanDZ / TF-FaSNet
View on GitHub
☆24Feb 28, 2023Updated 3 years ago
Okrio / deepvqe
View on GitHub
☆14Oct 12, 2023Updated 2 years ago
onolab-tmu / code_2020ICASSP_five
View on GitHub
Fast Independent Vector Extraction: Code and data to reproduce the results from the paper.
☆25May 7, 2020Updated 6 years ago
YosukeSugiura / Wave-U-Net-for-Speech-Enhancement-NNabla
View on GitHub
Wave U Net (NNabla)
☆13Jul 1, 2020Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
merlresearch / reverberation-as-supervision
View on GitHub
Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation
☆15Aug 1, 2024Updated last year
fgnt / graph_pit
View on GitHub
☆42Oct 14, 2022Updated 3 years ago
l3das / L3DAS22
View on GitHub
☆57Jun 4, 2022Updated 4 years ago
hustvl / RND-SCI
View on GitHub
A Range-Null Space Decomposition Approach for Fast and Flexible Spectral Compressive Imaging
☆11May 18, 2023Updated 3 years ago
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
jx1100370217 / ASR_dosmono
View on GitHub
Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)
☆12Aug 9, 2018Updated 7 years ago
jordi-adell / mcarray
View on GitHub
Library for real-time digital signal processing of microphone array signals. It is based on DSPONE adn WIPP and can perform binarula loca…
☆16Mar 23, 2017Updated 9 years ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
fgnt / speaker_reassignment
View on GitHub
Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
☆14Feb 5, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Andong-Li-speech / Neural-Vocoders-as-Speech-Enhancers
View on GitHub
☆52Sep 10, 2024Updated last year
Happenmass / stable-diffusion-webui-tensorRT-sdxl
View on GitHub
Stable-diffusion-WebUI extensions, which enable tensorrt accelerated Unet for SDXL base model
☆12Oct 18, 2023Updated 2 years ago
fakufaku / auxiva-ipa
View on GitHub
Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.
☆36Mar 22, 2021Updated 5 years ago
yuguochencuc / CinCGAN-SE
View on GitHub
Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement
☆10Jan 24, 2022Updated 4 years ago
ShlezingerLab / deepsic-official
View on GitHub
☆12May 10, 2023Updated 3 years ago
sp-uhh / uncertainty-SE
View on GitHub
☆17Mar 30, 2023Updated 3 years ago
alibabasglab / D2Former
View on GitHub
This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Ma…
☆46Sep 6, 2023Updated 2 years ago
AmirAbaskohi / Automatic-Speech-recognition-for-Speech-Assessment-of-Persian-Preschool-Children
View on GitHub
Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The …
☆20May 24, 2023Updated 3 years ago
mobvoi / lstm_ctc
View on GitHub
LSTM CTC End2End Speech Recognition.
☆38Apr 2, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
ssprl / Real-time-Blind-source-separation-using-IVA
View on GitHub
☆16Apr 24, 2021Updated 5 years ago
ehabets / INF-Generator
View on GitHub
Generating sensor signals in isotropic noise fields (MATLAB)
☆47Mar 17, 2023Updated 3 years ago
echocatzh / SPEEX-AEC-python
View on GitHub
Multi-Delay Filter( or Partioned-block based Frequency-domain Adaptive Filter) impl with python.
☆32Oct 12, 2021Updated 4 years ago
microsoft / NOTSOFAR1-Challenge
View on GitHub
NOTSOFAR-1 Challenge: Distant Diarization and ASR
☆65Feb 12, 2025Updated last year
yluo42 / GC3
View on GitHub
☆51May 16, 2021Updated 5 years ago
vkothapally / JAECBF
View on GitHub
☆62Apr 11, 2022Updated 4 years ago
wenet-e2e / wesignal
View on GitHub
Production first, nn-based on-device signal processing toolkit.
☆63May 30, 2023Updated 3 years ago
ichi131 / Direction-based-BiTSE
View on GitHub
☆15Sep 19, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
AIFSH / SenseVoice-ComfyUI
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
TzuchengChang / NASS
View on GitHub
Noise-Aware Speech Separation with Contrastive Learning
☆21Apr 25, 2024Updated 2 years ago
felixfuyihui / Optimize-FixBF-Weight
View on GitHub
☆17Jun 3, 2020Updated 6 years ago
XZWY / SpatialCodec
View on GitHub
Implementation of SpatialCodec.
☆71Sep 23, 2023Updated 2 years ago
Andong-Li-speech / TaEr
View on GitHub
This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…
☆14Nov 25, 2022Updated 3 years ago
fakufaku / create_wsj1_2345_db
View on GitHub
Collection of scripts to create a dataset of noisy multi-channel reverberant mixtures based on wsj1 and CHiME3 datasets.
☆15Dec 6, 2021Updated 4 years ago
knightrain / Mandarin-TTS
View on GitHub
A simple TTS(text-to-speech) engine for Chinese mandarin
☆21Feb 20, 2012Updated 14 years ago