☆69Jul 17, 2024Updated last year
Alternatives and similar repositories for telespeech-asr-python
Users that are interested in telespeech-asr-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆839Jun 7, 2024Updated last year
- The repo provides information about KeSpeech dataset.☆174Oct 13, 2022Updated 3 years ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 5 months ago
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Python Wrapper of Silero VAD☆64May 8, 2025Updated 10 months ago
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- 几种VAD算法的测评☆25Jul 31, 2020Updated 5 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- Keyword spotting, Speech wake_up, by pytorch, DNN, CNN, TDNN, DFSMN, LSTM☆54Mar 15, 2022Updated 4 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- 求取语音的MFCC参数和GFCC参数,可用于语音信号特征提取☆10Jul 19, 2021Updated 4 years ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆15Jul 4, 2024Updated last year
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆16Dec 1, 2022Updated 3 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆27Nov 20, 2024Updated last year
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆26Apr 12, 2024Updated last year
- Colab notebooks for Next-gen Kaldi☆30Oct 12, 2025Updated 5 months ago
- ☆23Oct 17, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Test Framework for few-shot open set KWS☆42Nov 8, 2024Updated last year
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- noise reduction☆17Jul 3, 2024Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago
- ☆28Oct 7, 2025Updated 5 months ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated 2 years ago
- TTS inference in C++ based on TFlite model☆20Jan 18, 2021Updated 5 years ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆32Apr 26, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Sep 25, 2024Updated last year
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆214Aug 7, 2025Updated 7 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆53Jan 26, 2026Updated 2 months ago
- faster inference☆28Jan 20, 2025Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 5 months ago