jiwidi / DeepSpeech-pytorchView external linksLinks
Pytorch implementation for DeepSpeech 2.0
☆31Jul 25, 2024Updated last year
Alternatives and similar repositories for DeepSpeech-pytorch
Users that are interested in DeepSpeech-pytorch are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)☆26Mar 5, 2021Updated 4 years ago
- ☆16Oct 7, 2022Updated 3 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Oct 29, 2020Updated 5 years ago
- ☆10Jul 29, 2025Updated 6 months ago
- OpenPose: A Real-Time Multi-Person Keypoint Detection And Multi-Threading C++ Library☆12Jul 13, 2017Updated 8 years ago
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆721Dec 17, 2025Updated last month
- [INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation☆41Sep 1, 2023Updated 2 years ago
- Language and Speech Technology for Central Kurdish Varieties (LREC-COLING 2024)☆11Nov 29, 2024Updated last year
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago
- AzukiはC# 2.0で書かれたフリーのテキストエディタエンジンです。オリジナル版を github で fork して拡張版を作成しています。☆11Feb 26, 2023Updated 2 years ago
- Make N-Gram for Uyghur language☆15Dec 24, 2020Updated 5 years ago
- Expected edit distance implementation using OpenFst tools☆11May 13, 2015Updated 10 years ago
- BigBlueButton API for .NET☆11Sep 12, 2022Updated 3 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- TensorFlowLiteNet allows to use TensorFlowLite from C#.☆11Apr 14, 2021Updated 4 years ago
- A fine multimodality fusion network :)☆11Aug 9, 2021Updated 4 years ago
- Meta-Learning for End-to-End ASR☆10Aug 8, 2020Updated 5 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- A PyTorch implementation of SimSiam based on CVPR 2021 paper "Exploring Simple Siamese Representation Learning"☆12Mar 23, 2021Updated 4 years ago
- Speech to text transcription using RNN (Listen, Attend and Spell).☆11Aug 23, 2019Updated 6 years ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆16Jun 27, 2025Updated 7 months ago
- ☆14Mar 28, 2018Updated 7 years ago
- Python wrapper for the EDDL library.☆13Jun 14, 2022Updated 3 years ago
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆11Jan 16, 2021Updated 5 years ago
- Framework for Deep Speech Recognition☆11Nov 22, 2022Updated 3 years ago
- Examples for the VSRL-Framework☆11Sep 17, 2025Updated 4 months ago
- A full-text error corrector for English based on transformers and deep learning☆10Jan 8, 2023Updated 3 years ago
- speech recognition based on deep neural network/hidden markov model☆10Jun 3, 2020Updated 5 years ago
- ☆11May 18, 2022Updated 3 years ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 2 years ago
- ☆13Mar 25, 2021Updated 4 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Dec 12, 2018Updated 7 years ago
- Out-of-box mtcnn detector using onnxruntime or opencv☆11Apr 21, 2021Updated 4 years ago
- Sample code to record audio and save to Server Side Blazor using MediaRecorder API and Recorder.js☆13Dec 26, 2020Updated 5 years ago
- ☆10Oct 16, 2019Updated 6 years ago
- The material is covered in my YouTube playlist "Data Wrangling with Python" available on YUNIKARN.☆15Dec 9, 2025Updated 2 months ago