Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.
☆15Nov 5, 2022Updated 3 years ago
Alternatives and similar repositories for Keyword-MLP
Users that are interested in Keyword-MLP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial PyTorch implementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting", Berg et al. 2021.☆40Oct 11, 2022Updated 3 years ago
- Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769☆139Apr 29, 2022Updated 3 years ago
- Evaluation kit for the HEAR Benchmark☆63Feb 12, 2026Updated last month
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆31Mar 6, 2025Updated last year
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- source code for the paper publised in IJCNN 2020 "The Impact of Audio Input Representations on Neural Network based Music Transcription"☆13Apr 9, 2020Updated 5 years ago
- Continual Learning Benchmark for Spoken Keyword Spotting☆17Jun 7, 2022Updated 3 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23May 19, 2021Updated 4 years ago
- ☆16May 8, 2022Updated 3 years ago
- Implementation and Deployment of Multilingual Custom Keyword Spotting Running in Real-time on an Edge Device.☆11Apr 27, 2023Updated 2 years ago
- Feedforward Sequential Memory Networks☆16Aug 2, 2022Updated 3 years ago
- This project related to one of my B.Tech final year project that investigates the influence of linguistic and sentiment analysis features…☆14Oct 22, 2023Updated 2 years ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆19Nov 13, 2020Updated 5 years ago
- 给定一张身份证正、反面,识别身份证上的所有文字信息。☆10Sep 4, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Public repository for "DCT-Former: Efficient Self-Attention withDiscrete Cosine Transform"☆18Mar 15, 2023Updated 3 years ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Sep 13, 2024Updated last year
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- ☆88May 27, 2023Updated 2 years ago
- ☆11May 31, 2020Updated 5 years ago
- RBM+BP神经网络识别手写数字和英文字符☆11Mar 25, 2023Updated 3 years ago
- 端到端的中文场景文字识别。☆12Jun 27, 2022Updated 3 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago
- ☆13Dec 23, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 7 years ago
- A spoken version of the textual story cloze benchmark☆20Aug 6, 2023Updated 2 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/ab…☆12Aug 21, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- A jekyll template derived from Minimal Mistakes and inspired by academicpages. To see an example of what a webpage might look like with t…☆15May 22, 2018Updated 7 years ago
- Multimodal Speech Recognition for phoneme level prediction using Audio-Visual data from TCDTIMIT dataset implementing RNNs with LSTMs for…☆15Jul 27, 2023Updated 2 years ago
- speech recognition based on deep neural network/hidden markov model☆10Jun 3, 2020Updated 5 years ago
- Numpy手写BP神经网络,对比Dropout、Batch Normalization等训练技巧的效果。☆11Dec 19, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This repository contains supplementary material for the paper: "Audio Source Separation Using Variational Autoencoders and Weak Class Sup…☆11Jan 10, 2023Updated 3 years ago
- speech recognition of digits based on single Gaussian, Gaussian Mixture, and Hidden Markov Models☆11Jun 3, 2020Updated 5 years ago
- Repository for "Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages"☆15Oct 4, 2024Updated last year
- This repository represents a basic implementation of the paper "Riemannian Geometry of Deep Generative Models", along with the results on…☆12Oct 23, 2019Updated 6 years ago
- ☆21Mar 1, 2023Updated 3 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- The successful and effective management of a busy and complex warehouse relies upon the control and location of stock within the warehous…☆15Mar 17, 2021Updated 5 years ago