Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.
☆43Dec 14, 2022Updated 3 years ago
Alternatives and similar repositories for Speech-Commands-Classification-by-LSTM-PyTorch
Users that are interested in Speech-Commands-Classification-by-LSTM-PyTorch are comparing it to the libraries listed below
Sorting:
- Keyword Spotting for detecting a word in an audio file☆17Jul 21, 2019Updated 6 years ago
- ☆21Mar 8, 2020Updated 5 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- Speech Commands Recognition using end-to-end deep learning models in pytorch☆28Oct 8, 2020Updated 5 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Jan 28, 2019Updated 7 years ago
- PyTorch implementations of neural network models for keyword spotting☆11Oct 19, 2020Updated 5 years ago
- 利用Torch和强化学习训练flappy bird小游戏☆14Nov 18, 2022Updated 3 years ago
- Predicting Billboard's Year-End Hot 100 Songs using audio features from Spotify and lyrics from Musixmatch☆18Jul 14, 2024Updated last year
- In this repository, I implement a system for detecting specific spoken words in speech signal. When reading a speech signal, I detect not…☆19Sep 27, 2021Updated 4 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Jul 15, 2021Updated 4 years ago
- Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…☆23Mar 26, 2017Updated 8 years ago
- Attention-based model for keywords spotting☆19Aug 9, 2021Updated 4 years ago
- Speech commands recognition with PyTorch | Kaggle 10th place solution in TensorFlow Speech Recognition Challenge☆200Jan 19, 2024Updated 2 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Jul 21, 2018Updated 7 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Apr 26, 2023Updated 2 years ago
- Repository for code and paper submitted for APSIPA 2019, Lanzhou, China☆21Aug 2, 2024Updated last year
- ☆22Sep 3, 2018Updated 7 years ago
- https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques …☆26May 5, 2017Updated 8 years ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.☆29Feb 12, 2018Updated 8 years ago
- A neural attention model for speech command recognition☆186Jul 12, 2025Updated 7 months ago
- Using spectrograms and convolutional neural networks to listen to environment sounds.☆32Jul 23, 2021Updated 4 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- a Federated Learning Framework adapted for resource-constrained environments, focusing on IoT devices☆10Oct 6, 2025Updated 5 months ago
- A lightweight library to read/write wave audio files to/from lists of native Python types.☆12Jun 10, 2024Updated last year
- Implementation of Dynamic Computation Offloading Control Logic in a Software-Defined Vehicle (SDV) System☆11Dec 19, 2024Updated last year
- ☆35Apr 8, 2019Updated 6 years ago
- Speaker Identification using GMM and Speech Recognition using HMMs☆38Apr 7, 2014Updated 11 years ago
- Protect workers with TensorFlow Hard Hat object detection model on a Jetson Nano☆10Sep 27, 2022Updated 3 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Tool for slot extraction from text☆15Oct 23, 2022Updated 3 years ago
- Latex document template of Final Degree Projects done in ETSISI UPM school.☆10Apr 27, 2025Updated 10 months ago
- Source code for "Congestion-aware Distributed Task Offloading in Wireless Multi-hop Networks Using Graph Neural Networks"☆14Oct 23, 2024Updated last year
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆12Aug 1, 2025Updated 7 months ago
- ☆10Apr 2, 2024Updated last year
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 5 years ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago
- An HTTP client for the Rust AWS SDK that runs on Fastly Compute @ Edge☆10Nov 11, 2025Updated 3 months ago
- Teaching the Donkey car to drive a track in the simulator using State Representation Learning and different Reinforcement Learning Algori…☆12Dec 6, 2021Updated 4 years ago