pengzhendong / asr-decoder
CTC decoder with hotwords for ASR.
☆19Updated 3 weeks ago
Alternatives and similar repositories for asr-decoder:
Users that are interested in asr-decoder are comparing it to the libraries listed below
- faster inference☆28Updated 3 months ago
- (WIP)long form speech generatoins☆31Updated last month
- ☆19Updated 6 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆12Updated 4 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆49Updated 9 months ago
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 4 months ago
- ☆26Updated 3 months ago
- noise reduction☆17Updated 10 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆67Updated 6 months ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆14Updated 2 months ago
- ☆28Updated last week
- Utilizes ONNX Runtime for audio denoising.☆45Updated this week
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆24Updated last month
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆29Updated last year
- Self-supervised Generative LM-based Voice Conversion☆27Updated last week
- Just another FastSpeech 2 but cleaner code :)☆26Updated 10 months ago
- ☆18Updated last year
- One command to start a streaming ASR server.☆12Updated 7 months ago
- ☆20Updated 6 months ago
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆27Updated last month
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆23Updated this week
- Streaming Vocos☆24Updated 3 months ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆28Updated last year
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆12Updated last month
- real-time speech enhance☆15Updated last year
- Huawei Grad-TTS for Chinese☆50Updated last year
- ☆64Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 11 months ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆93Updated 4 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆27Updated 9 months ago