LuluW8071 / Automatic-Speech-Recognition-with-PyTorchView external linksLinks
Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡
☆11Jan 23, 2025Updated last year
Alternatives and similar repositories for Automatic-Speech-Recognition-with-PyTorch
Users that are interested in Automatic-Speech-Recognition-with-PyTorch are comparing it to the libraries listed below
Sorting:
- ☆16Jan 11, 2026Updated last month
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- ☆21Jul 16, 2025Updated 7 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Nov 1, 2024Updated last year
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆76Jul 29, 2024Updated last year
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆42Sep 5, 2025Updated 5 months ago
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆36Aug 7, 2024Updated last year
- ☆164Oct 1, 2025Updated 4 months ago
- From Python basics to Machine Learning and PyTorch Deep Learning - one day at a time, explore it all☆10May 25, 2025Updated 8 months ago
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 4 months ago
- The Ecoacoustic Dataset from Arctic North Slope Alaska☆11May 29, 2025Updated 8 months ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- ☆19Aug 25, 2025Updated 5 months ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- ☆13Dec 1, 2025Updated 2 months ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- ☆11May 9, 2023Updated 2 years ago
- ☆24Aug 29, 2025Updated 5 months ago
- 记录关于AEC的论文和代码、博客以及 相关资料☆15Jul 26, 2022Updated 3 years ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- The A2C Reinforcement Learning Algorithm in Pytorch☆16May 13, 2024Updated last year
- ☆14Jan 5, 2022Updated 4 years ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- ☆14Dec 20, 2022Updated 3 years ago
- The rag pipeline for optimizing dynamic data editing.☆20Oct 30, 2025Updated 3 months ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Jul 16, 2022Updated 3 years ago
- Keep track of good articles on speech processing, mainly on speech enhancement, include speech denoise, speech dereverberation and aec、ag…☆47Jul 17, 2024Updated last year
- https://wavelandspeech.github.io/☆10Jan 12, 2024Updated 2 years ago
- Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation☆28Dec 10, 2025Updated 2 months ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Jun 17, 2021Updated 4 years ago
- A Unified Deep Learning Framework for ssTEM Image Restoration☆12Jul 27, 2022Updated 3 years ago
- [ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"☆13Jul 26, 2021Updated 4 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- Image-source method for room acoustics☆14Feb 5, 2020Updated 6 years ago
- PyTorch implementation of Continuous Speech Separation☆12Oct 5, 2022Updated 3 years ago
- ☆10Oct 24, 2024Updated last year