☆69Jul 17, 2024Updated last year
Alternatives and similar repositories for telespeech-asr-python
Users that are interested in telespeech-asr-python are comparing it to the libraries listed below
Sorting:
- ☆835Jun 7, 2024Updated last year
- The repo provides information about KeSpeech dataset.☆172Oct 13, 2022Updated 3 years ago
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated 2 years ago
- Colab notebooks for Next-gen Kaldi☆30Oct 12, 2025Updated 4 months ago
- noise reduction☆17Jul 3, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆25Apr 12, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM☆53Mar 15, 2022Updated 3 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- Python Wrapper of Silero VAD☆64May 8, 2025Updated 9 months ago
- ☆23Oct 17, 2024Updated last year
- windows端翻译软件。提供划词翻译、截图翻译、ai翻译等功能☆12Apr 24, 2025Updated 10 months ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 4 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 4 months ago
- ☆13Sep 25, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 11 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- 求取语音的MFCC参数和GFCC参数,可用于语音信号特征提取☆10Jul 19, 2021Updated 4 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago
- ☆28Oct 7, 2025Updated 5 months ago
- faster inference☆28Jan 20, 2025Updated last year
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆213Aug 7, 2025Updated 6 months ago
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆54Dec 23, 2024Updated last year
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago