Speech command classification on Speech-Command v0.02 dataset using PyTorch and torchaudio. In this example, three models have been trained using the raw signal waveforms, MFCC features and MelSpectogram features.
☆10Dec 5, 2022Updated 3 years ago
Alternatives and similar repositories for Speech-Command-Classification
Users that are interested in Speech-Command-Classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augment…☆43Dec 14, 2022Updated 3 years ago
- Latest PyTorch Implementation of DeltaGRU & DeltaLSTM that Exploits Temporal Sparsity in Sequential Data☆18Sep 30, 2023Updated 2 years ago
- Triangle Attack: A Query-efficient Decision-based Adversarial Attack (ECCV 2022)☆16Jul 19, 2022Updated 3 years ago
- ESP32 ESP-IDF Components (I2C, 1-wire, SPI, ADC, etc.) GUVA-S12SD AHTXX AK8975 AS7341 BH1750FVI BME680 BMP280 BMP390 CCS811 ENS160 HDC108…☆43Nov 29, 2025Updated 7 months ago
- Attention-based multimodal fusion for sentiment analysis☆13Aug 14, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Lightweight CNN for Robust Voice Activity Detection☆20Jun 30, 2023Updated 3 years ago
- kaggle情感分析rnn+attention解法☆12Nov 17, 2017Updated 8 years ago
- Multimodal Affective Analysis Using Hierarchical Attention Strategy☆12Dec 7, 2018Updated 7 years ago
- waveform classification with recordings from mouse brain using Neuropixels probes☆21Sep 29, 2022Updated 3 years ago
- Voice Activity Detector based on MFCC features and DNN model☆30Jul 3, 2023Updated 2 years ago
- Implementation of the paper "Emotion Identification from raw speech signals using DNNs"☆14Jun 11, 2020Updated 6 years ago
- An open source toolbox for LMDI decomposition analysis in Python☆16May 10, 2026Updated last month
- Multimodal preprocessing on IEMOCAP dataset☆13Jun 8, 2018Updated 8 years ago
- A Tensorflow implementation of Speech Emotion Recognition using Audio signals and Text Data☆12May 16, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 2019科大讯飞 阿尔茨海默综合症预测挑战赛baseline☆12Jul 12, 2019Updated 6 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- RISC-V Zve32x, Zve32f, Zvfh Vector Coprocessor☆20Jun 18, 2026Updated last week
- This is my PyTorch implementation of the "Very Deep Convolutional Neural Networks For Raw Waveforms" research paper published in 2016.☆17Aug 24, 2021Updated 4 years ago
- Can audio-visual integration strengthen robustness under multimodal attacks?☆30Mar 31, 2022Updated 4 years ago
- Cherokee Audio data☆11Dec 24, 2023Updated 2 years ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆18Oct 26, 2021Updated 4 years ago
- A framework for writing window managers in familiar HTML, JS, and CSS.☆44Nov 29, 2025Updated 7 months ago
- Radar and communication waveform classification using Deep Learning techniques☆28Aug 3, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 对糖尿病数据集的研究☆22Jul 6, 2017Updated 8 years ago
- ☆17Dec 5, 2019Updated 6 years ago
- Python repository of Grey Models☆18Jul 31, 2024Updated last year
- Voice Activity Detection☆29Nov 13, 2017Updated 8 years ago
- Alzheimer's Dementia Recognition through Spontaneous Speech The ADReSSo Challenge☆14Aug 6, 2023Updated 2 years ago
- A NVIDIA's Pytorch Tacotron2 adaptation with unsupervised Global Style Tokens. The model has been trained with the English read-speech LJ…☆10Sep 4, 2023Updated 2 years ago
- A transformer that decodes swipes across a smartphone keyboard into words (gesture / swipe / glide typing) (enhanced yandex cup solution)☆15Feb 20, 2026Updated 4 months ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆32Jun 27, 2019Updated 7 years ago
- The Pytorch implementation of paper Multimodal fusion for alzheimer's disease recognition☆17Aug 23, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Machine Learning Approach for the Diagnosis of Parkinson's Disease via Speech Analysis☆21Dec 27, 2020Updated 5 years ago
- Code & Data for the Paper "Time Masking for Temporal Language Models", WSDM 2022☆20Apr 23, 2023Updated 3 years ago
- JUCE audio plugin for realtime pitch shifting and voice duplication from MIDI keyboard input. Works differently than a vocoder as it can …☆12Sep 14, 2021Updated 4 years ago
- The final coursework for AI in Mental Health @ PKU.☆22Jan 5, 2024Updated 2 years ago
- A Pytorch implementation of emotion recognition from videos☆18Sep 15, 2020Updated 5 years ago
- A Beamerposter template with University of Cambridge logo and colors. It is forked from Gemini.☆24Nov 27, 2024Updated last year
- 基于深度学习的普通话语音识别☆18Apr 23, 2019Updated 7 years ago