Pytorch implementation of "spectro-temporal attention-based voice activity detection"
☆13Jun 4, 2024Updated last year
Alternatives and similar repositories for STAM-pytorch
Users that are interested in STAM-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆27Mar 20, 2021Updated 5 years ago
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆17Aug 1, 2024Updated last year
- A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.☆13Dec 3, 2020Updated 5 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- Lightweight CNN for Robust Voice Activity Detection☆20Jun 30, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Repo for our pooling approach on the DCASE2018 task4☆16Jul 6, 2023Updated 2 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆52Apr 7, 2019Updated 7 years ago
- RNN implementation with Tensorflow (LSTM) to classify variable length sound sequences☆23Aug 19, 2022Updated 3 years ago
- A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…☆23Nov 25, 2024Updated last year
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆140Jan 20, 2024Updated 2 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- Voice Activity Detection☆29Nov 13, 2017Updated 8 years ago
- 3D Sound Source Localization using Masked Autoencoders☆19Feb 12, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago
- This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)☆42Jul 10, 2024Updated last year
- 在Android上运行人脸表情识别的tflite模型☆12Apr 7, 2021Updated 5 years ago
- Computer programming - ShanghaiTech☆12Jan 10, 2020Updated 6 years ago
- ☆10Jan 26, 2021Updated 5 years ago
- This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Qu…☆23Apr 2, 2025Updated last year
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆26Aug 21, 2024Updated last year
- shortwave reception software☆14Jul 17, 2018Updated 7 years ago
- PCAP 从入门到成神☆13Sep 26, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for calculate DNS_MOS.☆43Dec 18, 2022Updated 3 years ago
- Implementation of StyleTTS for Mandarin☆11Jun 22, 2023Updated 2 years ago
- A practical way of learning Swizzle☆38Feb 3, 2025Updated last year
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- Code for "Speaker Clustering using Dominant Sets", ICPR 2018☆11Nov 28, 2020Updated 5 years ago
- Script to generate VAD dataset used in Asteroid recipe☆21Sep 30, 2021Updated 4 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- E2E ASR system☆14Oct 20, 2022Updated 3 years ago
- ☆10Mar 21, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Feedforward Sequential Memory Networks☆17Aug 2, 2022Updated 3 years ago
- 基于pytorch的CRNN☆16Feb 28, 2019Updated 7 years ago
- Implementation of Sheffield entry for Clarity enhancement challenge.☆18Apr 19, 2022Updated 4 years ago
- Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (…☆31Dec 19, 2019Updated 6 years ago
- Diffusion Net TensorFlow implementation☆10Nov 10, 2017Updated 8 years ago
- Library for diffusion maps☆50Dec 22, 2021Updated 4 years ago
- ☆16Mar 29, 2022Updated 4 years ago