In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
☆20Jul 18, 2018Updated 7 years ago
Alternatives and similar repositories for learning_invariances_in_speech_recognition
Users that are interested in learning_invariances_in_speech_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Dec 12, 2018Updated 7 years ago
- Google Speech Command Dataset Classification Neural Network, CNN, RNN☆26Aug 29, 2017Updated 8 years ago
- a very simple vocal tract model, few tube model. generate vowel sound by it☆18Jul 9, 2023Updated 2 years ago
- This was for a DevICT (http://devict.org) presentation given July, 2020.☆13Jul 9, 2020Updated 5 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆33Nov 27, 2021Updated 4 years ago
- ☆19Nov 20, 2021Updated 4 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Jan 19, 2018Updated 8 years ago
- TensorFlowLiteNet allows to use TensorFlowLite from C#.☆11Apr 14, 2021Updated 5 years ago
- Privacy-preserving Voice Analysis via Disentangled Representations☆12Aug 30, 2021Updated 4 years ago
- try implement go version sqlite channel for flutter sqflite plugin[desktop only]☆16Jul 16, 2020Updated 5 years ago
- Directory theme for caddy caddyfile☆17Jan 23, 2017Updated 9 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated 2 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆11Jun 15, 2022Updated 3 years ago
- there are UKIJ and Uighursoft fonts☆13Oct 21, 2022Updated 3 years ago
- ☆11Apr 19, 2026Updated last week
- Train neural network via pytorch, and run nn model on ESP32☆11Dec 1, 2022Updated 3 years ago
- Spell correction language model for Uyghur language based on transformer neural network☆15Jun 18, 2025Updated 10 months ago
- uyghur text resource crawled from website☆12Dec 25, 2015Updated 10 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Target speaker automatic speech recognition (TS-ASR)☆13Oct 14, 2023Updated 2 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A fully functional Othello (Reversi) game, with several AIs, made in prolog for swipl.☆17Apr 26, 2024Updated 2 years ago
- Time delay neural network (TDNN) implementation in Pytorch using unfold method☆204Nov 21, 2019Updated 6 years ago
- a standalone pitch extractor☆13Oct 19, 2017Updated 8 years ago
- UzTransliterator | State-of-the-art machine transliteration tool for Uzbek language☆13Jan 6, 2026Updated 3 months ago
- dashboard module for emoncms☆14Oct 23, 2025Updated 6 months ago
- Make N-Gram for Uyghur language☆15Dec 24, 2020Updated 5 years ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 3 years ago
- OBD Scan Tool .NET 2.0☆13Oct 5, 2015Updated 10 years ago
- Music Catalogizer + MP3 ID tag parser + Radio (WPF, WebApi, Angular)☆14Oct 20, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- It is a WIP C# .Net Framework implementation of the original markitdown Python library.☆25Mar 12, 2025Updated last year
- A query by humming system based on locality sensitive hashing indexes☆12May 8, 2014Updated 11 years ago
- Framework for Deep Speech Recognition☆11Nov 22, 2022Updated 3 years ago
- This is a single-speaker neural text-to-speech (TTS) system capable of training in a end-to-end fashion. It is inspired by the Tacotron a…☆12Dec 28, 2018Updated 7 years ago
- Master thesis of Ondrej Platek: Automatic speech recognition using Kaldi. Supervised by Filip Jurcicek.☆15Feb 20, 2020Updated 6 years ago
- ☆13Oct 12, 2018Updated 7 years ago