R1ckShi / SeACo-ParaformerLinks
[ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.
☆38Updated last year
Alternatives and similar repositories for SeACo-Paraformer
Users that are interested in SeACo-Paraformer are comparing it to the libraries listed below
Sorting:
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆34Updated 2 years ago
- Speech samples and code of BEdit-TTS☆34Updated 2 years ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Updated last year
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆24Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆96Updated last year
- faster inference☆28Updated last year
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Updated 2 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Updated last year
- ☆29Updated 11 months ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆74Updated 7 months ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆63Updated 4 months ago
- Chinese Text Normalization and Dataset☆90Updated 3 years ago
- ☆78Updated 7 months ago
- In-car multi-channel speech transcription system of AISHELL-5.☆39Updated 7 months ago
- Production first, nn-based on-device signal processing toolkit.☆65Updated 2 years ago
- Official repository for the WenetSpeech-Chuan dataset.☆137Updated 2 months ago
- A ctc decoder for both online and offline asr model☆66Updated 2 years ago
- ☆13Updated 2 years ago
- ☆32Updated 3 years ago
- One command to build TLG.fst for WeNet.☆30Updated 3 years ago
- ☆64Updated 3 years ago
- ☆46Updated 2 years ago
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆37Updated last week
- Streaming Audiotransformers for online Audio tagging☆50Updated last year
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆54Updated 2 years ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Updated last year
- Went online decode demo☆31Updated 4 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Updated 3 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Updated 2 years ago