Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.
☆13Jun 11, 2021Updated 4 years ago
Alternatives and similar repositories for wav2kws
Users that are interested in wav2kws are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Attention-based model for keywords spotting☆19Aug 9, 2021Updated 4 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- NUS ME5413 Autonomous Mobile Robotics Final Project☆18Apr 6, 2025Updated 11 months ago
- This is a Python project that uses Selenium and OpenAI to scrape data from the web, process it with GPT-3, and generate reports based on …☆12Oct 28, 2025Updated 5 months ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Jan 28, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Few-shot text classification with meta learning and BERT☆11Jun 14, 2021Updated 4 years ago
- Repository for the code of the simplex non-negative matrix factorization algorithm for EDXS data☆14Feb 6, 2025Updated last year
- Keyword spotting for audio with attention (KWS model for audio)☆18Jul 15, 2021Updated 4 years ago
- Step-by-step tutorials for learning drone development with PX4, ROS 2, and Gazebo simulation. From basic setup to camera integration and …☆19Mar 26, 2025Updated last year
- Guide on how to use MLDA GPU servers for Machine Learning☆14Oct 5, 2021Updated 4 years ago
- A mobile application that can help users get the perfect blackboard photos.☆25Jun 2, 2024Updated last year
- An implementation of MatchboxNet☆13May 4, 2022Updated 3 years ago
- ☆16Sep 12, 2020Updated 5 years ago
- SLS : Neural Information Retrieval(IR)-based Semantic Search model☆13Mar 21, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- NUS Report Template LaTex☆18Oct 19, 2025Updated 5 months ago
- ☆16Jun 12, 2023Updated 2 years ago
- ☆10Apr 23, 2024Updated last year
- Test Framework for few-shot open set KWS☆42Nov 8, 2024Updated last year
- BC-ResNet for Keyword Spotting☆42Jan 11, 2022Updated 4 years ago
- ☆19Nov 4, 2022Updated 3 years ago
- A Python scrapper to access ModDB mods, games and more as objects☆16Mar 22, 2026Updated last week
- Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769☆139Apr 29, 2022Updated 3 years ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆107Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Alif Semiconductor Repository forked from the ARM ml-embedded-evaluation-kit from https://git.mlplatform.org/ml/ethos-u/ml-embedded-evalu…☆13Mar 11, 2026Updated 2 weeks ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Jan 11, 2023Updated 3 years ago
- ☆18Apr 15, 2020Updated 5 years ago
- ☆13Jan 8, 2024Updated 2 years ago
- MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitfli…☆19May 24, 2018Updated 7 years ago
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆283May 23, 2022Updated 3 years ago
- Vietnamese BERT pre-trained model of FPT.AI☆14Oct 16, 2020Updated 5 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- ☆13Jan 10, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset☆12Dec 11, 2019Updated 6 years ago
- This is an on-going project repository☆15Feb 10, 2024Updated 2 years ago
- We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt an…☆30Apr 4, 2023Updated 2 years ago
- Open source voice bot for Humanoid Robots and virtual digital humans☆17Apr 24, 2022Updated 3 years ago
- Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quali…☆26Jul 4, 2025Updated 8 months ago
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Jul 27, 2024Updated last year