SpeechYOLO Interspeech 2019
☆47Aug 16, 2022Updated 3 years ago
Alternatives and similar repositories for speech_yolo
Users that are interested in speech_yolo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Mar 26, 2026Updated last month
- clip retrieval benchmark☆17May 4, 2022Updated 4 years ago
- Combine YOLOv3 with MiDaS with a single Resnext101 backbone for Autonomous Navigation☆25Jan 17, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"☆10Jan 25, 2021Updated 5 years ago
- An online speech recognition extension toolkit of Kaldi☆55Jun 23, 2021Updated 4 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Apr 3, 2022Updated 4 years ago
- More Than YOLO(v3, v4, v3-tiny, v4-tiny)☆154Feb 14, 2022Updated 4 years ago
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Nov 14, 2020Updated 5 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Jun 28, 2015Updated 10 years ago
- Python C extension for the eSpeak speech synthesizer☆12Jan 23, 2021Updated 5 years ago
- ☆10Mar 21, 2018Updated 8 years ago
- Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…☆11May 29, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- ☆11Oct 19, 2024Updated last year
- Pytorch code for Tracklet Association Unsupervised Deep Learning (TAUDL)☆16Jan 5, 2021Updated 5 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Mar 14, 2018Updated 8 years ago
- Lightweight face detectors with landmarks. Training code using pytorch and inference using pytorch/ncnn/tensorflow/tflite.☆10Jul 1, 2020Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- A database of clean and noisy speech for audio research☆10Jan 26, 2018Updated 8 years ago
- Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).☆32Jul 25, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Jul 20, 2022Updated 3 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- Calculates the Word Error Rate between two text files☆20Nov 10, 2022Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- Pepper Robot Enhanced Human Interaction☆14Dec 8, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43May 9, 2023Updated 3 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Feb 1, 2017Updated 9 years ago
- Quasi-Newton Algorithm for Stochastic Optimization☆11May 20, 2022Updated 3 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Official Implementation of Mockingjay in Pytorch☆56Jul 6, 2023Updated 2 years ago