☆10Sep 19, 2018Updated 7 years ago
Alternatives and similar repositories for end-point-detection
Users that are interested in end-point-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository☆26Jul 26, 2020Updated 5 years ago
- "Recurrent Models of Visual Attention" in TensorFlow☆41Apr 13, 2017Updated 8 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)☆17Aug 31, 2017Updated 8 years ago
- a optional way to extract audio feature☆13Jun 10, 2017Updated 8 years ago
- Easier analysis of large speech corpora☆23Jun 22, 2021Updated 4 years ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- Deep neural network based speech enhancement toolkit☆220Jun 14, 2019Updated 6 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆15Sep 4, 2019Updated 6 years ago
- Auto-KWS 2021 Challenge 1st place solution.☆11Jul 20, 2021Updated 4 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'☆12Mar 22, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆13Mar 25, 2021Updated 5 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Mar 4, 2021Updated 5 years ago
- VAD + resampling | High resolution spectrogram☆14Nov 29, 2022Updated 3 years ago
- This repository☆30Nov 13, 2022Updated 3 years ago
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago
- This repository includes the code to reproduce our paper Partially-Connected Differentiable Architecture Search for Deepfake and Spoofing…☆18Apr 30, 2022Updated 3 years ago
- Mel-Generalized Cepstrum analysis☆20Jul 21, 2017Updated 8 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- Construct GMM-HMM and Implement the Viterbi algorithm for continuous speech recognition☆15Apr 1, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Script to simulate room impulse responses☆15Sep 29, 2016Updated 9 years ago
- Official Implementation of "Domain Adaptive Few-Shot Open-Set Learning" in IEEE/CVF International Conference on Computer Vision (ICCV'23)☆16Dec 18, 2023Updated 2 years ago
- ☆20Feb 8, 2026Updated last month
- Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"☆32Sep 3, 2018Updated 7 years ago
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆24Sep 1, 2023Updated 2 years ago
- A Python package with command-line utilities and scripts to aid the development of machine learning models for Silicon Lab's embedded pl…☆63Aug 20, 2025Updated 7 months ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆869Jun 9, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- ☆19Feb 17, 2023Updated 3 years ago
- This repository contains code for a tutorial on end to end automatic speech recognition.☆18Sep 10, 2019Updated 6 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆110Jan 24, 2019Updated 7 years ago
- Implementation and Benchmark Splits to study Out-of-Distribution Generalization in Deep Metric Learning.☆25Oct 2, 2021Updated 4 years ago
- ☆10May 15, 2021Updated 4 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago