In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Datasetโฆ
โ20Jul 18, 2018Updated 7 years ago
Alternatives and similar repositories for learning_invariances_in_speech_recognition
Users that are interested in learning_invariances_in_speech_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spiking ๐ง and artificial ๐ค RNN solutions to Speech Commands Dataset ๐ฃ๏ธ in TensorFlowโ14Feb 3, 2021Updated 5 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Networkโ10Dec 12, 2018Updated 7 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.โ25Jan 28, 2019Updated 7 years ago
- This was for a DevICT (http://devict.org) presentation given July, 2020.โ13Jul 9, 2020Updated 5 years ago
- The easiest way to start developing with Amazon Mechanical Turk. This will show how to quickly create an ExternalQuestion using Python (Fโฆโ23Dec 26, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform โข AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- โ33Nov 27, 2021Updated 4 years ago
- Automatic Arabic diacritics restoration tool.โ18Aug 12, 2021Updated 4 years ago
- ipcipher is a specification for encrypting IP{v4,v6} addresses 'in place'.โ20Mar 28, 2018Updated 8 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) Dโฆโ25Jan 19, 2018Updated 8 years ago
- TensorFlowLiteNet allows to use TensorFlowLite from C#.โ11Apr 14, 2021Updated 5 years ago
- An experimental method for jumping to lines on screenโ11Jun 20, 2020Updated 5 years ago
- Following research on S4 in jaxโ16Jun 15, 2022Updated 3 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Ratesโ12Mar 13, 2024Updated 2 years ago
- โ11Jun 15, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI โข AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- there are UKIJ and Uighursoft fontsโ13Oct 21, 2022Updated 3 years ago
- Configuration files for my setupโ10Jun 4, 2023Updated 2 years ago
- โ11May 17, 2026Updated last week
- Train neural network via pytorch, and run nn model on ESP32โ11Dec 1, 2022Updated 3 years ago
- Spell correction language model for Uyghur language based on transformer neural networkโ15Jun 18, 2025Updated 11 months ago
- uyghur text resource crawled from websiteโ12Dec 25, 2015Updated 10 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITSโ48Dec 1, 2022Updated 3 years ago
- Target speaker automatic speech recognition (TS-ASR)โ14Oct 14, 2023Updated 2 years ago
- Transformer based ASR Engine.โ13Aug 23, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting โข AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An HTML5 Mumble client for Chrome, Firefox, Edge and now also Safariโ26May 14, 2026Updated last week
- Time delay neural network (TDNN) implementation in Pytorch using unfold methodโ204Nov 21, 2019Updated 6 years ago
- A python documentation linter which checks that the docstring description matches the definition. Based on darglint by @terrencepreilly.โ26Apr 1, 2024Updated 2 years ago
- a standalone pitch extractorโ13Oct 19, 2017Updated 8 years ago
- Volcengine TOS C++ SDKโ11May 8, 2026Updated 2 weeks ago
- dashboard module for emoncmsโ14May 11, 2026Updated 2 weeks ago
- Make N-Gram for Uyghur languageโ15Dec 24, 2020Updated 5 years ago
- NNSVSๅใใฎๆๅธซใใผใฟใฎใฉใใซไฝๆๆฏๆดใใผใซใงใใโ10Apr 5, 2023Updated 3 years ago
- OBD Scan Tool .NET 2.0โ13Oct 5, 2015Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI โข AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Music Catalogizer + MP3 ID tag parser + Radio (WPF, WebApi, Angular)โ14Oct 20, 2021Updated 4 years ago
- Speech Recognition for Uyghur using deep learningโ42Oct 21, 2021Updated 4 years ago
- real-time speech enhanceโ17Jan 23, 2024Updated 2 years ago
- A query by humming system based on locality sensitive hashing indexesโ12May 8, 2014Updated 12 years ago
- โ11Nov 13, 2015Updated 10 years ago
- A daemon which watches a queue and runs stable diffusion.โ28Sep 23, 2022Updated 3 years ago
- This is a single-speaker neural text-to-speech (TTS) system capable of training in a end-to-end fashion. It is inspired by the Tacotron aโฆโ12Dec 28, 2018Updated 7 years ago