☆16Dec 27, 2023Updated 2 years ago
Alternatives and similar repositories for realtime_nkf_aec
Users that are interested in realtime_nkf_aec are comparing it to the libraries listed below
Sorting:
- ☆21Jul 29, 2024Updated last year
- Acoustic Echo Cancellation with Nerual Kalman Filtering☆348Feb 21, 2023Updated 3 years ago
- This is the unofficial implementation of MFNet, from paper''a Mask Free Neural Network for Monaural Speech Enhancement''☆13Dec 20, 2024Updated last year
- finetune the chain model based on cvte open source model without traing any GMM for frame alignment☆13Aug 6, 2020Updated 5 years ago
- 2014TI 杯(D 题)带啸叫检测与抑制的音频功率放大器。使用 STM32 单片机实现的电压电流采集,并且 LCD 屏示波。AD 转换并且将采集的数据进行快速傅里叶变换生成频谱图。进行啸叫检测。☆22Mar 3, 2019Updated 7 years ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆25Nov 12, 2025Updated 3 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion☆20Jul 9, 2019Updated 6 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- ☆36Jan 6, 2026Updated 2 months ago
- ☆10Feb 19, 2020Updated 6 years ago
- A time delay estimation method for event-based time-series data. Time delay estimation is also known as the correction of time offsets an…☆15Dec 3, 2025Updated 3 months ago
- ☆46Jan 14, 2025Updated last year
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆32Jul 9, 2024Updated last year
- 基于深度学习的声学回声消除基线代码☆158May 21, 2021Updated 4 years ago
- Whisper finetuning☆16Apr 9, 2025Updated 11 months ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 20…☆16Sep 1, 2024Updated last year
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆23Updated this week
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- Russian phonetical transcription☆11Nov 19, 2025Updated 3 months ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- ☆13Oct 9, 2025Updated 5 months ago
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆27Feb 13, 2026Updated 3 weeks ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Vosk Speech Recognition API) and TRANSLATED SUBTITLE FILE…☆11May 5, 2024Updated last year
- superfast text to speech in any voice☆61Feb 16, 2026Updated 3 weeks ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆42Feb 4, 2026Updated last month
- ☆48Feb 14, 2025Updated last year
- ☆11Nov 3, 2023Updated 2 years ago
- text to speech☆10Mar 19, 2024Updated last year
- ☆13Jan 2, 2025Updated last year
- ☆11Jun 14, 2024Updated last year
- Example python scripts to evaluate various ASR methods☆11Dec 22, 2021Updated 4 years ago
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- Docker for building an environment for Dutch online and offline ASR.☆12Feb 2, 2021Updated 5 years ago