Pytorch implementation for DeepSpeech 2.0
☆31Jul 25, 2024Updated last year
Alternatives and similar repositories for DeepSpeech-pytorch
Users that are interested in DeepSpeech-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)☆29Mar 5, 2021Updated 5 years ago
- Code for the paper "Bag of features for voice anti-spoofing"☆13Jul 6, 2023Updated 2 years ago
- CHiME-5 Baseline Array Synchronisation☆12Sep 24, 2018Updated 7 years ago
- OpenPose: A Real-Time Multi-Person Keypoint Detection And Multi-Threading C++ Library☆12Jul 13, 2017Updated 8 years ago
- Speech to text transcription using RNN (Listen, Attend and Spell).☆11Aug 23, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆17Dec 13, 2019Updated 6 years ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆723Dec 17, 2025Updated 5 months ago
- Implementations for master thesis "Musical Instrument Recognition in Multi-Instrument Audio Contexts" with MedleyDB.☆16Apr 4, 2019Updated 7 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 4 years ago
- the MEX wrapper for PESQ (Perceptual Evaluation of Speech Quality)☆15May 10, 2019Updated 7 years ago
- Tesseract4 finetuned traineddata for Central Kurdish/Sorani☆11Apr 18, 2020Updated 6 years ago
- Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.☆19Oct 28, 2025Updated 7 months ago
- Meta-Learning for End-to-End ASR☆10Aug 8, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Faster version of AugShuffleNet without channel shuffle, computes partially, crossovers swiftly☆11Feb 17, 2025Updated last year
- Voice Activity Detection System☆21Jun 9, 2015Updated 10 years ago
- 跨平台网络库,使用epoll和iocp模型☆11May 13, 2018Updated 8 years ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- ☆13Jun 20, 2019Updated 6 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- An minimal Seq2Seq example of Automatic Speech Recognition (ASR) based on Transformer☆85Apr 29, 2024Updated 2 years ago
- Conditioned U-Net for Music Source Separation☆20May 15, 2021Updated 5 years ago
- ☆16Sep 24, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Oct 7, 2022Updated 3 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Oct 29, 2020Updated 5 years ago
- Code for "Prior Convictions: Black-box Adversarial Attacks with Bandits and Priors"☆13Sep 27, 2018Updated 7 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 11 years ago
- Updated version of the RUBiS benchmark (http://rubis.ow2.org/)☆12Jun 20, 2017Updated 8 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Some code for "Stealing Part of a Production Language Model"☆22Mar 20, 2024Updated 2 years ago
- 用C++实现的一个简单的线程池,支持任务队列,实际任务继承自taskbase。☆12Apr 15, 2015Updated 11 years ago
- Kubernetes operator that updates automatically existing deployment's images to the latest version, in a customized way.☆13Aug 31, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.☆69Updated this week
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Compute WER and SER for speech recognition evaluation☆26Mar 18, 2026Updated 2 months ago
- Code for the NeurIPS 2019 submission: "Improving Black-box Adversarial Attacks with a Transfer-based Prior".☆15May 6, 2020Updated 6 years ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆18Jun 27, 2025Updated 11 months ago
- Helm chart and Terraform modules for the JARVICE XE Hybrid Cloud HPC platform☆17May 22, 2026Updated last week
- Yolo(including yolov1 yolov2 yolov3)running on caffe windows. Anyone that is not familiar with linux can use this project to learn caffe …☆18Jun 15, 2018Updated 7 years ago