SpeechYOLO Interspeech 2019
☆46Aug 16, 2022Updated 3 years ago
Alternatives and similar repositories for speech_yolo
Users that are interested in speech_yolo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- Combine YOLOv3 with MiDaS with a single Resnext101 backbone for Autonomous Navigation☆25Jan 17, 2021Updated 5 years ago
- Developing an algorithm using MATLAB to detect the unknown location(coordinates) of a sound source in a closed room using a series of mic…☆14Jan 10, 2018Updated 8 years ago
- Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"☆10Jan 25, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An online speech recognition extension toolkit of Kaldi☆55Jun 23, 2021Updated 4 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Apr 3, 2022Updated 4 years ago
- More Than YOLO(v3, v4, v3-tiny, v4-tiny)☆154Feb 14, 2022Updated 4 years ago
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Nov 14, 2020Updated 5 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Jun 28, 2015Updated 10 years ago
- Python C extension for the eSpeak speech synthesizer☆12Jan 23, 2021Updated 5 years ago
- ☆10Mar 21, 2018Updated 8 years ago
- Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…☆11May 29, 2016Updated 9 years ago
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Oct 19, 2024Updated last year
- YOLOv4 Pytorch implementation with all freebies and specials and 15+ more exclusive improvements. Easy to use!☆132Aug 3, 2021Updated 4 years ago
- Natural language dataset for training a Conversational Recommender System☆11Jul 9, 2019Updated 6 years ago
- Pytorch code for Tracklet Association Unsupervised Deep Learning (TAUDL)☆16Jan 5, 2021Updated 5 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Mar 14, 2018Updated 8 years ago
- Lightweight face detectors with landmarks. Training code using pytorch and inference using pytorch/ncnn/tensorflow/tflite.☆10Jul 1, 2020Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).☆31Jul 25, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Jul 20, 2022Updated 3 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Camera acquisition☆11Feb 11, 2026Updated 2 months ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- Calculates the Word Error Rate between two text files☆20Nov 10, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [TGRS 2023] An official implementation of Multitype Feature Perception and Refined Network for Spaceborne Infrared Ship Detection☆12May 23, 2024Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- Analytic signal spectrograms with optimized time-frequency resolution☆10Oct 6, 2020Updated 5 years ago
- Summary of methods to convert models in Yolo-v4-v3-v2☆74Jun 18, 2020Updated 5 years ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆43May 9, 2023Updated 2 years ago