JohannesBuchner/spoken-command-recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JohannesBuchner/spoken-command-recognition)

JohannesBuchner / spoken-command-recognition

A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition

☆71

Alternatives and similar repositories for spoken-command-recognition

Users that are interested in spoken-command-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
type-a / speechnet
View on GitHub
Automatic Speech Recognition
☆20Aug 24, 2018Updated 7 years ago
rampa069 / PhnRec
View on GitHub
Phoneme recognizer based on long temporal context (with ALIZE VAD command added)
☆17Apr 7, 2012Updated 14 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
jchristman / adventofcode
View on GitHub
Advent Of Code - Python One-Liner Challenge
☆12Dec 13, 2015Updated 10 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
morelen17 / tts-papers
View on GitHub
List of papers about TTS / Список статей о TTS
☆10Dec 16, 2017Updated 8 years ago
Mmiglio / SpeechRecognition
View on GitHub
Small-footprint Keyword Spotting
☆18Jul 28, 2019Updated 7 years ago
ricardokleinklein / deepMultiSpeech
View on GitHub
Deep Multi-Speech model
☆11Jul 25, 2018Updated 8 years ago
ICLR-DAP / Deep-Audio-Prior
View on GitHub
Anonymous ICLR Submission
☆14Sep 25, 2019Updated 6 years ago
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
sonyc-project / urban-sound-tagging-baseline
View on GitHub
☆15Mar 24, 2023Updated 3 years ago
cjbayron / artist2lyrics
View on GitHub
Lyrics crawling, pre-processing, embedding generation, model training, and lyrics generation - all in one tool
☆14Nov 4, 2018Updated 7 years ago
usc-sail / barista
View on GitHub
Barista is an open-source framework for concurrent speech processing.
☆36Mar 19, 2014Updated 12 years ago
zhangxiangnick / wordvec-aligned-en-zh
View on GitHub
Aligned bilingual word vectors for English and Chinese
☆11Jun 25, 2018Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
netankit / AudioMLProject1
View on GitHub
Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…
☆18May 3, 2015Updated 11 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
adamsolomou / Speech-Enhancement
View on GitHub
Real-time speech enhancement based on spectral subtraction
☆16Feb 18, 2018Updated 8 years ago
blmorris / uPy_AudioCodec
View on GitHub
Eagle design files for an I2S Audio codec daughterboard for the MicroPython pyboard
☆11Dec 2, 2015Updated 10 years ago
lyapple2008 / audioSignalProcess
View on GitHub
Some DSP algorithm implementation
☆18Sep 26, 2018Updated 7 years ago
vivianngo97 / Punctuation_Transcription
View on GitHub
A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.
☆15Aug 6, 2020Updated 5 years ago
aasish / userIntentDataset
View on GitHub
☆14Dec 27, 2016Updated 9 years ago
sarangzambare / hey-siri
View on GitHub
This repository is for wake-word detection in speech using recurrent neural networks
☆18Feb 25, 2019Updated 7 years ago
jim-schwoebel / audioset_models
View on GitHub
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).
☆31Jun 17, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lucko515 / Speech-commands-recognition
View on GitHub
Recognizing common speech commands using Keras and Tensorflow.
☆10Dec 17, 2018Updated 7 years ago
taylorlu / AudioKWS
View on GitHub
Audio Keyword Search
☆12May 5, 2019Updated 7 years ago
Giruvegan / stoneskipping
View on GitHub
StoneSkipping model for detecting Chinese camouflaged spam
☆20May 8, 2020Updated 6 years ago
douglas125 / SpeechCmdRecognition
View on GitHub
A neural attention model for speech command recognition
☆187Jul 12, 2025Updated last year
hcmlab / emovoice
View on GitHub
Build your own Real-time Speech Emotion Recognizer
☆119Feb 1, 2019Updated 7 years ago
RLuke22 / curriculum-learning-acr
View on GitHub
ISMIR 2021: Curriculum Learning for Imbalanced Classification in Large Vocabulary Automatic Chord Recognition
☆10Nov 8, 2021Updated 4 years ago
patrickltobing / shallow-wavenet
View on GitHub
☆18Feb 9, 2020Updated 6 years ago
hyli666 / DNN-SpeechEnhancement
View on GitHub
☆55Jul 21, 2019Updated 7 years ago
lifelongeek / AAS_enhancement
View on GitHub
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…
☆28Oct 10, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
fosfrancesco / pkspell
View on GitHub
Predict the correct pitch spelling and key signatures given a sequence of midi notes by using a deep-learning approach.
☆18Jul 26, 2022Updated 4 years ago
nycsv / Voice_Activity_Detector
View on GitHub
A statistical model-based Voice Activity Detection
☆196Nov 30, 2018Updated 7 years ago
jtkim-kaist / VAD
View on GitHub
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
☆869Jun 9, 2021Updated 5 years ago
vishalshar / SpeakerDiarization_RNN_CNN_LSTM
View on GitHub
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…
☆64Jan 8, 2021Updated 5 years ago
3140102441 / speak-recognization
View on GitHub
matlab 说话人语音识别
☆22Jul 22, 2017Updated 9 years ago
karamarieliu / gst_tacotron2_wavenet
View on GitHub
☆13Aug 11, 2018Updated 7 years ago
iiitv / lyrics-crawler
View on GitHub
Simple crawler to collect lyrics, written in Python
☆10Sep 8, 2018Updated 7 years ago