arthurfortes/speech2text_keras

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/arthurfortes/speech2text_keras)

arthurfortes / speech2text_keras

This repository reports how to build a speech to text model to recognize short commands. Best of all, developing and including speech recognition in a Python project using Keras is really simple.

☆23

Alternatives and similar repositories for speech2text_keras

Users that are interested in speech2text_keras are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

basaldella / bioreddit
View on GitHub
Word embeddings trained on medical subreddits.
☆10Jan 4, 2021Updated 5 years ago
FalseNegativeLab / mlscorecheck
View on GitHub
Testing the consistency of binary classification performance scores reported in papers
☆12Aug 21, 2025Updated 10 months ago
omarperacha / GANkyoku
View on GitHub
A Generative Adversarial Network for Shakuhachi Music
☆14Jul 2, 2019Updated 7 years ago
image-js / ocr-tools
View on GitHub
Tools for optical character recognition (OCR)
☆10Jun 1, 2022Updated 4 years ago
lucko515 / Speech-commands-recognition
View on GitHub
Recognizing common speech commands using Keras and Tensorflow.
☆10Dec 17, 2018Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
glederrey / DATGAN
View on GitHub
Directed Acyclic Tabular GAN (DATGAN) for integrating expert knowledge in synthetic tabular data generation
☆18Oct 19, 2024Updated last year
Gekkio / samsung-photo-frame-ctrl
View on GitHub
A small Python application for controlling Samsung photo frames
☆13May 7, 2020Updated 6 years ago
AdamWilfred / How-to-create-an-interactive-ebook
View on GitHub
I'd like to share my experience of using a free software to create ebooks, and describe the steps I followed to achieve that. One of the …
☆16Feb 9, 2016Updated 10 years ago
codyaray / speaker-recognition
View on GitHub
Speaker recognition system based upon classiﬁcation of Mel-Frequency Cepstral Coefficients (MFCC) using a minimum-distance classifier and…
☆20Sep 15, 2010Updated 15 years ago
aruno14 / speechRecognition
View on GitHub
Speech recognition system implemented using tensorflow
☆16Feb 2, 2023Updated 3 years ago
yinoue93 / CS224N_proj
View on GitHub
A repo for CS224N Final Project
☆15Apr 11, 2017Updated 9 years ago
markqvist / rnsh
View on GitHub
rnsh is a command-line utility written in Python that facilitates shell sessions over Reticulum networks and aims to provide a similar ex…
☆20Apr 26, 2026Updated 2 months ago
amruthpillai / OCR-Reader
View on GitHub
An Android Application that will allow you to identify the text seen from your phone camera, and also be able to speak the text that's id…
☆11Oct 26, 2016Updated 9 years ago
holm-aune-bachelor2018 / ctc
View on GitHub
Speech recognition with CTC in Keras with Tensorflow backend
☆31Mar 24, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mbenkmann / snb2txt
View on GitHub
Convert Samsung S-Note .snb files to markdown text files.
☆19Jun 19, 2023Updated 3 years ago
bagustris / dimensional-ser
View on GitHub
Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning
☆17Aug 2, 2024Updated last year
TheWall89 / AndrOCR
View on GitHub
AndrOCR is an Android application for Optical Text Recognition using Google's Tesseract engine.
☆26Jul 31, 2013Updated 12 years ago
THEBOSS619 / Note9-Zeus-Q10.0
View on GitHub
Note9 Zeus Kernel, The First Android Q Kernel that made on XDA & Outside,Full EAS without Any Samsung's Work. Android-Q branch used from …
☆11Jun 11, 2020Updated 6 years ago
fernandodelacalle / ResNet-Kaldi-Tensorflow-ASR
View on GitHub
Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.
☆21Mar 22, 2017Updated 9 years ago
sagiebenaim / Singing
View on GitHub
☆19May 9, 2019Updated 7 years ago
nglehuy / ctc_decoders
View on GitHub
Baidu's CTC Decoders, including Greedy, Beam Search and Beam Search with KenLM Language Model
☆24Oct 28, 2023Updated 2 years ago
eigenben / vexflow-json
View on GitHub
A wrapper for the VexFlow staff engraving library to render staff notation from simple JSON instead complex API calls.
☆45Feb 9, 2016Updated 10 years ago
richarddmorey / encrypt_data_example
View on GitHub
Shows how to encrypt data held in public space
☆11Aug 11, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
xavierfav / feature-comparison-clustering
View on GitHub
Comparing Audio Features for Unsupervised Sound Classification
☆10Jun 22, 2022Updated 4 years ago
alexzaitsev / ocr-google-vision
View on GitHub
Sample project demonstrating how OCR can be implemented using Google Vision library
☆16Dec 22, 2017Updated 8 years ago
seanpm2001 / UltraSwitch
View on GitHub
UltraSwitch is a service similar to Samsung Smart Switch that allows full backups of data from one device to another, for Android, iOS, L…
☆20Jan 1, 2026Updated 6 months ago
manymuch / Natural-Noise-Generator
View on GitHub
☆10Aug 3, 2019Updated 6 years ago
findnitai / TDNN-layer
View on GitHub
A keras layer implementation of Peddinti's paper "A time delay neural network architecture for efficient modeling of long temporal conte…
☆13Nov 19, 2018Updated 7 years ago
rhasspy / wav2mel
View on GitHub
Transform audio files into mel spectrograms for text-to-speech model training
☆12Aug 25, 2021Updated 4 years ago
desiFish / Smart-Aquarium-V2.0
View on GitHub
Using NodeMCU ESP8266 for controlling switches (Relay). Uses RTC (Real Time Clock - DS3231) and NTP (Network Time Protocol) for maintaini…
☆12Jun 5, 2025Updated last year
ReidarRiveland / Instruct-RNN
View on GitHub
☆15Mar 21, 2024Updated 2 years ago
muhdhuz / audio2spec
View on GitHub
Scripts to convert audio files to spectrograms and back
☆12Nov 23, 2017Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
crsh / cognitive_models
View on GitHub
Collection of my implementations of computational models of cognition
☆11Nov 20, 2023Updated 2 years ago
AdritPal08 / YouTube-Video-to-Notes-Transcription-Application-
View on GitHub
An end-to-end project: YouTube Video to Notes Transcription application using Google Gemini.
☆14Feb 3, 2024Updated 2 years ago
zassou65535 / WaveGAN
View on GitHub
WaveGANによる音声生成器
☆13Feb 9, 2024Updated 2 years ago
crsh / papaja_docker
View on GitHub
A Docker workflow to work reproducibly with papaja in RStudio
☆10Nov 16, 2021Updated 4 years ago
CoryMcCartan / adjustr
View on GitHub
An R package to help assess the sensitivity of a Bayesian model (fitted with Stan) to the specification of its likelihood and priors
☆11May 29, 2026Updated last month
cyprienruffino / CTCModel
View on GitHub
Easy-to-use Connectionnist Temporal Classification in Keras
☆78Aug 17, 2021Updated 4 years ago
Hiroshiba / openjtalk-label-getter
View on GitHub
☆10Dec 10, 2021Updated 4 years ago