Papers of ASR, Tools of ASR
☆41Feb 14, 2025Updated last year
Alternatives and similar repositories for asr_nlp_paper_code
Users that are interested in asr_nlp_paper_code are comparing it to the libraries listed below
Sorting:
- ☆11Oct 20, 2022Updated 3 years ago
- The dataset of Speech Recognition☆454Jan 4, 2026Updated 2 months ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Jun 17, 2022Updated 3 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- Rank 7th/1817 in the 2018 iFLYTEK AI Developer Challenge with acc 0.82 for the ten Chinese dialects classification task, this code was p…☆13Nov 19, 2023Updated 2 years ago
- PyTorch re-implementation of Speech-Transformer☆102Nov 19, 2021Updated 4 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- ☆41May 15, 2023Updated 2 years ago
- 分享在深蓝学院《语音识别:从入门到精通》第一期课程学习过程中完成的课后作业,供参考。☆21Sep 13, 2020Updated 5 years ago
- Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"☆20May 24, 2023Updated 2 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- Tensorflow version of DFSMN☆49Jul 17, 2018Updated 7 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- ☆22Nov 25, 2025Updated 3 months ago
- ☆41Oct 16, 2025Updated 5 months ago
- Collection of awesome Continual Test-Time Adaptation methods☆24Jun 4, 2024Updated last year
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 6 years ago
- ☆38Jun 5, 2023Updated 2 years ago
- ☆67Apr 3, 2023Updated 2 years ago
- ☆32Oct 28, 2022Updated 3 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- ☆18Feb 18, 2026Updated last month
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆42Jan 17, 2025Updated last year
- A neural speech codec based on discrete WavLM representations☆25Aug 28, 2024Updated last year
- 语音方向实验室/公司/资源/实习等,欢迎推荐或自荐☆599Nov 13, 2024Updated last year
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- ☆128Mar 21, 2023Updated 3 years ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 9 months ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆153Mar 3, 2026Updated 2 weeks ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Mar 10, 2021Updated 5 years ago
- A Generative Adversarial Network for Shakuhachi Music☆14Jul 2, 2019Updated 6 years ago
- 一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目,CPU上的实时率(RTF)小于0.1☆474Mar 13, 2025Updated last year
- ☆10Dec 16, 2018Updated 7 years ago