☆19Jan 5, 2020Updated 6 years ago
Alternatives and similar repositories for ry-Speech-commands
Users that are interested in ry-Speech-commands are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Attention-based model for keywords spotting☆19Aug 9, 2021Updated 4 years ago
- Broadcasted Residual Learning for Efficient Keyword Spotting☆23Jul 9, 2021Updated 4 years ago
- Tensorflow implementation of "Small-Footprint Keyword Spotting with Multi-Scale Temporal Convolution"(INTERSPEECH 2020)☆32Nov 11, 2020Updated 5 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Jul 15, 2021Updated 4 years ago
- Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices☆233Mar 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- PyTorch implementation of the SIESTA algorithm from our TMLR-2023 paper "SIESTA: Efficient Online Continual Learning with Sleep"☆13Oct 25, 2024Updated last year
- Mining effective negative training samples for keyword spotting (PyTorch)☆64May 23, 2020Updated 5 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Dec 8, 2019Updated 6 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 7 months ago
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆113Sep 14, 2022Updated 3 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆29Mar 3, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23May 19, 2021Updated 4 years ago
- ☆17Jun 13, 2022Updated 3 years ago
- Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769☆139Apr 29, 2022Updated 3 years ago
- Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"☆13Feb 22, 2022Updated 4 years ago
- ☆14Jul 17, 2020Updated 5 years ago
- RNN implementation with Tensorflow (LSTM) to classify variable length sound sequences☆23Aug 19, 2022Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆19Nov 13, 2020Updated 5 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Jan 11, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch implementations of neural network models for keyword spotting☆525May 22, 2023Updated 2 years ago
- ☆60Sep 26, 2020Updated 5 years ago
- An implementation of deep-voice-conversion using pytorch☆19Dec 10, 2021Updated 4 years ago
- ☆24Mar 18, 2024Updated 2 years ago
- A Pytorch based LSTM Punctuation Restoration Implementation/A Simple Tutorial for Leaning Pytorch and NLP☆24Jan 11, 2021Updated 5 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆31Jan 28, 2018Updated 8 years ago
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Jun 29, 2020Updated 5 years ago
- ☆13Jun 24, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Pytorch implementation of "Fast Training of Triplet-based Deep Binary Embedding Networks".☆40Dec 14, 2017Updated 8 years ago
- An echo cancellation library for browsers using DTLN-aec☆26Oct 18, 2023Updated 2 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Feb 25, 2019Updated 7 years ago
- Code for Interspeech2022 paper DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion☆13May 6, 2023Updated 2 years ago
- Pytorch implementation of Backdoor Attack against Speaker Verification☆28Sep 19, 2023Updated 2 years ago
- Seeing Wake Words: Audio-visual Keyword Spotting☆66Sep 16, 2020Updated 5 years ago
- Python version of "Fast Training of Triplet-based Deep Binary Embedding Networks" by Zhuang et al.☆12Sep 8, 2016Updated 9 years ago