Pytorch implementation for DeepSpeech 2.0
☆32Jul 25, 2024Updated last year
Alternatives and similar repositories for DeepSpeech-pytorch
Users that are interested in DeepSpeech-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)☆28Mar 5, 2021Updated 5 years ago
- Code for the paper "Bag of features for voice anti-spoofing"☆13Jul 6, 2023Updated 2 years ago
- Listen, Attend and spell model for E2E ASR. Implementation in Pytorch☆42Jun 22, 2022Updated 3 years ago
- OpenPose: A Real-Time Multi-Person Keypoint Detection And Multi-Threading C++ Library☆12Jul 13, 2017Updated 8 years ago
- Speech to text transcription using RNN (Listen, Attend and Spell).☆11Aug 23, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Dec 13, 2019Updated 6 years ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆722Dec 17, 2025Updated 4 months ago
- Implementations for master thesis "Musical Instrument Recognition in Multi-Instrument Audio Contexts" with MedleyDB.☆16Apr 4, 2019Updated 7 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 3 years ago
- the MEX wrapper for PESQ (Perceptual Evaluation of Speech Quality)☆15May 10, 2019Updated 6 years ago
- Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.☆19Oct 28, 2025Updated 6 months ago
- Faster version of AugShuffleNet without channel shuffle, computes partially, crossovers swiftly☆11Feb 17, 2025Updated last year
- 跨平台网络库,使用epoll和iocp模型☆11May 13, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- 基于DeepConvLSTM的传感器信号分类☆11May 15, 2018Updated 7 years ago
- ☆13Jun 20, 2019Updated 6 years ago
- Velocity Kernel for the Samsung Galaxy S8/S8+ (dreamlte/dream2lte). (discontinued)☆10May 30, 2019Updated 6 years ago
- ☆16Jan 14, 2025Updated last year
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- An minimal Seq2Seq example of Automatic Speech Recognition (ASR) based on Transformer☆85Apr 29, 2024Updated 2 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Oct 29, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repo contains the implementation of deep reinforcement learning (DRL) algorithms for virtual machine rescheduling in data centers.☆12Dec 2, 2022Updated 3 years ago
- Deploy deep learning model on difference hardware and framework. (TensorRT/ONNX/MNN/RKNN)☆13Jan 2, 2022Updated 4 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Some code for "Stealing Part of a Production Language Model"☆22Mar 20, 2024Updated 2 years ago
- ☆12May 18, 2020Updated 5 years ago
- ☆54Dec 13, 2022Updated 3 years ago
- 用C++实现的一个简单的线程池,支持任务队列,实际任务继承自taskbase。☆12Apr 15, 2015Updated 11 years ago
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- Kubernetes operator that updates automatically existing deployment's images to the latest version, in a customized way.☆13Aug 31, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.☆69Updated this week
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- `junior must know his place` team solution☆10Aug 15, 2023Updated 2 years ago
- Code for the NeurIPS 2019 submission: "Improving Black-box Adversarial Attacks with a Transfer-based Prior".☆15May 6, 2020Updated 6 years ago
- 本项目基于PaddleDetection目标检测开发套件,选取1.3M超轻量PPYOLO tiny进行项目开发,并部署于windows端。☆11May 30, 2021Updated 4 years ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆17Jun 27, 2025Updated 10 months ago