Pytorch implementation for DeepSpeech 2.0
☆31Jul 25, 2024Updated last year
Alternatives and similar repositories for DeepSpeech-pytorch
Users that are interested in DeepSpeech-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "Bag of features for voice anti-spoofing"☆13Jul 6, 2023Updated 2 years ago
- OpenPose: A Real-Time Multi-Person Keypoint Detection And Multi-Threading C++ Library☆12Jul 13, 2017Updated 8 years ago
- Speech to text transcription using RNN (Listen, Attend and Spell).☆11Aug 23, 2019Updated 6 years ago
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆722Dec 17, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.☆19Oct 28, 2025Updated 5 months ago
- Meta-Learning for End-to-End ASR☆10Aug 8, 2020Updated 5 years ago
- Faster version of AugShuffleNet without channel shuffle, computes partially, crossovers swiftly☆11Feb 17, 2025Updated last year
- Voice Activity Detection System☆21Jun 9, 2015Updated 10 years ago
- 跨平台网络库,使用epoll和iocp模型☆11May 13, 2018Updated 7 years ago
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- ☆13Jun 20, 2019Updated 6 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- 使用改良的Transformer模型应用于多维时间序列的分类任务上☆11Feb 8, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- Conditioned U-Net for Music Source Separation☆20May 15, 2021Updated 4 years ago
- Grapheme-to-Phoneme(G2P) 관련자료 모음☆20Jul 12, 2018Updated 7 years ago
- ☆16Oct 7, 2022Updated 3 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Oct 29, 2020Updated 5 years ago
- Deploy deep learning model on difference hardware and framework. (TensorRT/ONNX/MNN/RKNN)☆13Jan 2, 2022Updated 4 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Some code for "Stealing Part of a Production Language Model"☆22Mar 20, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Feb 3, 2022Updated 4 years ago
- 用C++实现的一个简单的线程池,支持任务队列,实际任务继承自taskbase。☆12Apr 15, 2015Updated 11 years ago
- Kubernetes operator that updates automatically existing deployment's images to the latest version, in a customized way.☆13Aug 31, 2022Updated 3 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated 3 weeks ago
- `junior must know his place` team solution☆10Aug 15, 2023Updated 2 years ago
- Code for the NeurIPS 2019 submission: "Improving Black-box Adversarial Attacks with a Transfer-based Prior".☆15May 6, 2020Updated 5 years ago
- 本项目基于PaddleDetection目标检测开发套件,选取1.3M超轻量PPYOLO tiny进行项目开发,并部署于windows端。☆11May 30, 2021Updated 4 years ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆17Jun 27, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Yolo(including yolov1 yolov2 yolov3)running on caffe windows. Anyone that is not familiar with linux can use this project to learn caffe …☆18Jun 15, 2018Updated 7 years ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆11Jan 16, 2021Updated 5 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated 11 months ago
- Hpyformer base FunASR☆30Nov 5, 2024Updated last year
- CosyVoice语音合成简易API☆14Nov 1, 2024Updated last year
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year