Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.
☆13Jun 11, 2021Updated 5 years ago
Alternatives and similar repositories for wav2kws
Users that are interested in wav2kws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Attention-based model for keywords spotting☆19Aug 9, 2021Updated 4 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- NUS ME5413 Autonomous Mobile Robotics Final Project☆18Apr 6, 2025Updated last year
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Jan 28, 2019Updated 7 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Jul 15, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Guide on how to use MLDA GPU servers for Machine Learning☆14Oct 5, 2021Updated 4 years ago
- An implementation of MatchboxNet☆13May 4, 2022Updated 4 years ago
- ☆10Apr 23, 2024Updated 2 years ago
- Test Framework for few-shot open set KWS☆43Nov 8, 2024Updated last year
- ☆19Nov 4, 2022Updated 3 years ago
- Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769☆140Apr 29, 2022Updated 4 years ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆110Dec 8, 2022Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆110Jan 11, 2023Updated 3 years ago
- MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitfli…☆19May 24, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆287May 23, 2022Updated 4 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 4 years ago
- Vietnamese BERT pre-trained model of FPT.AI☆14Oct 16, 2020Updated 5 years ago
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- This is an on-going project repository☆15Feb 10, 2024Updated 2 years ago
- Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quali…☆27Jul 4, 2025Updated 11 months ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Jul 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Region proposal network based small-footprint keyword spotting (Pytorch)☆57Nov 15, 2023Updated 2 years ago
- ☆10May 15, 2021Updated 5 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.☆15Aug 29, 2021Updated 4 years ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆32Mar 6, 2025Updated last year
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆10Dec 15, 2022Updated 3 years ago
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated 2 years ago
- Dataset release for Emotional TTS in Indian Accent☆41Mar 25, 2026Updated 2 months ago
- J-Net is aimed for audio separation with randomly weighted encoder.☆12Oct 23, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Sep 19, 2018Updated 7 years ago
- Simple sinc interpolation in PyTorch.☆15Jul 8, 2023Updated 2 years ago
- Rainbow Keywords - Official PyTorch Implementation☆14Jun 27, 2024Updated last year
- Paper: https://arxiv.org/abs/1702.02285☆64Dec 19, 2018Updated 7 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- Time-domain Audio Separation Network (IN PYTORCH)☆23Jan 28, 2019Updated 7 years ago
- Four neural network architectures to classify sound source direction☆11Oct 3, 2020Updated 5 years ago