taras-sereda / deep-learning-for-audioLinks
☆17Updated last year
Alternatives and similar repositories for deep-learning-for-audio
Users that are interested in deep-learning-for-audio are comparing it to the libraries listed below
Sorting:
- ☆21Updated 5 years ago
- Speech analytics package for call-center☆23Updated 4 years ago
- Accentor and transcriptor for Russian language☆123Updated 3 years ago
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- The VoxTube dataset official repository☆69Updated last year
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated 2 years ago
- ☆26Updated 3 weeks ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- ☆56Updated 2 years ago
- ☆37Updated last month
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆36Updated 10 months ago
- ☆13Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Experiments with grapheme2phoneme for Russian based on the artificial neural networks☆20Updated 4 years ago
- Home of Projector's "Data Science. Natural Language Processing" 2020 Edition☆19Updated last year
- Dictionary of obscene words for Ukrainian language☆18Updated last month
- A python library to generate speech dataset from Youtube videos☆36Updated last year
- Dictionary of word stresses in the Ukrainian language 🇺🇦☆20Updated 8 months ago
- radiomixer☆14Updated 3 years ago
- nlp workshop at datafest siberia 2019☆22Updated 2 years ago
- Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks☆119Updated 4 years ago
- ☆163Updated 2 years ago
- ☆38Updated 3 years ago
- ☆23Updated 3 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆154Updated 5 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆31Updated 2 years ago
- ☆46Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated last month