In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
☆20Jul 18, 2018Updated 7 years ago
Alternatives and similar repositories for learning_invariances_in_speech_recognition
Users that are interested in learning_invariances_in_speech_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Dec 12, 2018Updated 7 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Jan 28, 2019Updated 7 years ago
- Google Speech Command Dataset Classification Neural Network, CNN, RNN☆26Aug 29, 2017Updated 8 years ago
- The easiest way to start developing with Amazon Mechanical Turk. This will show how to quickly create an ExternalQuestion using Python (F…☆23Dec 26, 2022Updated 3 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ipcipher is a specification for encrypting IP{v4,v6} addresses 'in place'.☆20Mar 28, 2018Updated 8 years ago
- This repository contains code for the paper "Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs" (Wang, Lawrence…☆17Mar 8, 2021Updated 5 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Jan 19, 2018Updated 8 years ago
- TensorFlowLiteNet allows to use TensorFlowLite from C#.☆11Apr 14, 2021Updated 5 years ago
- An experimental method for jumping to lines on screen☆11Jun 20, 2020Updated 5 years ago
- try implement go version sqlite channel for flutter sqflite plugin[desktop only]☆16Jul 16, 2020Updated 5 years ago
- ☆10Mar 22, 2022Updated 4 years ago
- Directory theme for caddy caddyfile☆17Jan 23, 2017Updated 9 years ago
- Following research on S4 in jax☆16Jun 15, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated 2 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- ☆11Jun 15, 2022Updated 3 years ago
- Configuration files for my setup☆10Jun 4, 2023Updated 2 years ago
- ☆11May 10, 2026Updated last week
- Train neural network via pytorch, and run nn model on ESP32☆11Dec 1, 2022Updated 3 years ago
- Spell correction language model for Uyghur language based on transformer neural network☆15Jun 18, 2025Updated 11 months ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Dec 1, 2022Updated 3 years ago
- Target speaker automatic speech recognition (TS-ASR)☆14Oct 14, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- A fully functional Othello (Reversi) game, with several AIs, made in prolog for swipl.☆17Apr 26, 2024Updated 2 years ago
- An HTML5 Mumble client for Chrome, Firefox, Edge and now also Safari☆26May 12, 2026Updated last week
- Study materials created by myself for revision for University of Edinburgh's School of Informatics' exams.☆12Apr 25, 2018Updated 8 years ago
- A python documentation linter which checks that the docstring description matches the definition. Based on darglint by @terrencepreilly.☆26Apr 1, 2024Updated 2 years ago
- a standalone pitch extractor☆13Oct 19, 2017Updated 8 years ago
- Volcengine TOS C++ SDK☆11May 8, 2026Updated last week
- UzTransliterator | State-of-the-art machine transliteration tool for Uzbek language☆13Jan 6, 2026Updated 4 months ago
- dashboard module for emoncms☆14May 11, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Make N-Gram for Uyghur language☆15Dec 24, 2020Updated 5 years ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 3 years ago
- OBD Scan Tool .NET 2.0☆13Oct 5, 2015Updated 10 years ago
- Music Catalogizer + MP3 ID tag parser + Radio (WPF, WebApi, Angular)☆14Oct 20, 2021Updated 4 years ago
- Source code for pandaserd package - create an ERD diagram using pandas dataframes.☆18Apr 9, 2022Updated 4 years ago
- Speech Recognition for Uyghur using deep learning☆42Oct 21, 2021Updated 4 years ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago