This repository reports how to build a speech to text model to recognize short commands. Best of all, developing and including speech recognition in a Python project using Keras is really simple.
☆23Jul 21, 2020Updated 5 years ago
Alternatives and similar repositories for speech2text_keras
Users that are interested in speech2text_keras are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Generative Adversarial Network for Shakuhachi Music☆14Jul 2, 2019Updated 6 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆10Dec 17, 2018Updated 7 years ago
- Directed Acyclic Tabular GAN (DATGAN) for integrating expert knowledge in synthetic tabular data generation☆18Oct 19, 2024Updated last year
- implement end-to-end asr algorithm with tensorflow☆40Aug 23, 2018Updated 7 years ago
- Build jsPsych Experiments in R☆45Apr 7, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Speech recognition with CTC in Keras with Tensorflow backend☆31Mar 24, 2023Updated 3 years ago
- Speech recognition framework using keras☆14May 18, 2018Updated 8 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- Music generation with GAN!☆19Jun 4, 2017Updated 8 years ago
- ☆14Nov 13, 2017Updated 8 years ago
- ☆19May 9, 2019Updated 7 years ago
- A wrapper for the VexFlow staff engraving library to render staff notation from simple JSON instead complex API calls.☆45Feb 9, 2016Updated 10 years ago
- Shows how to encrypt data held in public space☆11Aug 11, 2017Updated 8 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆10Aug 3, 2019Updated 6 years ago
- A keras layer implementation of Peddinti's paper "A time delay neural network architecture for efficient modeling of long temporal conte…☆13Nov 19, 2018Updated 7 years ago
- Transform audio files into mel spectrograms for text-to-speech model training☆12Aug 25, 2021Updated 4 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- ☆15Mar 21, 2024Updated 2 years ago
- Scripts to convert audio files to spectrograms and back☆12Nov 23, 2017Updated 8 years ago
- Collection of my implementations of computational models of cognition☆11Nov 20, 2023Updated 2 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23May 19, 2021Updated 5 years ago
- A powerful toolbox for the analysis of HD-EMG recordings☆60May 10, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- WaveGANによる音声生成器☆13Feb 9, 2024Updated 2 years ago
- A Docker workflow to work reproducibly with papaja in RStudio☆10Nov 16, 2021Updated 4 years ago
- An R package to help assess the sensitivity of a Bayesian model (fitted with Stan) to the specification of its likelihood and priors☆11Apr 8, 2025Updated last year
- Easy-to-use Connectionnist Temporal Classification in Keras☆78Aug 17, 2021Updated 4 years ago
- ☆14Aug 25, 2021Updated 4 years ago
- Multi-lingual AudioCaps☆14Nov 20, 2023Updated 2 years ago
- CNN based Minimal model for recognizing word☆61May 7, 2018Updated 8 years ago
- Encode an image to sound (WAV file) and view it as a spectrogram. Optimized Python 3 version.☆11Jan 25, 2023Updated 3 years ago
- Rainbowgram with Python☆13Jan 28, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- Prepare spectrograms from audio for training a Riffusion model☆16Mar 6, 2023Updated 3 years ago
- Rich text editor for notebooks☆15Mar 13, 2026Updated 2 months ago
- Dimensionality reduction (UMAP, t-SNE, PCA) for ImageJ/Fiji☆12May 6, 2025Updated last year
- ☆11Nov 29, 2020Updated 5 years ago
- Grad-CAM (Gradient-weighted Class Activation Mapping)☆13Dec 20, 2019Updated 6 years ago
- Convert images to audio for display in a spectrogram☆12Apr 17, 2018Updated 8 years ago