Pytorch implementation for DeepSpeech 2.0
☆31Jul 25, 2024Updated last year
Alternatives and similar repositories for DeepSpeech-pytorch
Users that are interested in DeepSpeech-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Bag of features for voice anti-spoofing"☆13Jul 6, 2023Updated 2 years ago
- Listen, Attend and spell model for E2E ASR. Implementation in Pytorch☆42Jun 22, 2022Updated 3 years ago
- Speech to text transcription using RNN (Listen, Attend and Spell).☆11Aug 23, 2019Updated 6 years ago
- ☆17Dec 13, 2019Updated 6 years ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆724Dec 17, 2025Updated 3 months ago
- Implementations for master thesis "Musical Instrument Recognition in Multi-Instrument Audio Contexts" with MedleyDB.☆16Apr 4, 2019Updated 6 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 3 years ago
- the MEX wrapper for PESQ (Perceptual Evaluation of Speech Quality)☆15May 10, 2019Updated 6 years ago
- Tesseract4 finetuned traineddata for Central Kurdish/Sorani☆11Apr 18, 2020Updated 5 years ago
- Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.☆19Oct 28, 2025Updated 4 months ago
- Meta-Learning for End-to-End ASR☆10Aug 8, 2020Updated 5 years ago
- Faster version of AugShuffleNet without channel shuffle, computes partially, crossovers swiftly☆11Feb 17, 2025Updated last year
- Voice Activity Detection System☆21Jun 9, 2015Updated 10 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- An minimal Seq2Seq example of Automatic Speech Recognition (ASR) based on Transformer☆83Apr 29, 2024Updated last year
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- 使用改良的Transformer模型应用于多维时间序列的分类任务上☆11Feb 8, 2021Updated 5 years ago
- ☆16Oct 7, 2022Updated 3 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Oct 29, 2020Updated 5 years ago
- Code for "Prior Convictions: Black-box Adversarial Attacks with Bandits and Priors"☆13Sep 27, 2018Updated 7 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 用C++实现的一个简单的线程池,支持任务队列,实际任务继承自taskbase。☆12Apr 15, 2015Updated 10 years ago
- ☆12Feb 3, 2022Updated 4 years ago
- Kubernetes operator that updates automatically existing deployment's images to the latest version, in a customized way.☆13Aug 31, 2022Updated 3 years ago
- ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.☆69Nov 3, 2022Updated 3 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last week
- Code for the NeurIPS 2019 submission: "Improving Black-box Adversarial Attacks with a Transfer-based Prior".☆15May 6, 2020Updated 5 years ago
- 本项目基于PaddleDetection目标检测开发套件,选取1.3M超轻量PPYOLO tiny进行项目开发,并部署于windows端。☆11May 30, 2021Updated 4 years ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 简单的Modern C++ Socket跨平台可伸缩实现☆21Apr 22, 2021Updated 4 years ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆11Jan 16, 2021Updated 5 years ago
- Sentiment Analysis via RNN, RNTN. Based on Stanford's Sentiment Analysis page.☆10Feb 5, 2015Updated 11 years ago
- use TCN and Transformer model for "Hourly Energy Consumption" data☆13Apr 29, 2022Updated 3 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆16May 16, 2025Updated 10 months ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- keras搭建卷积神经网络模型☆11Feb 6, 2018Updated 8 years ago