SpeechYOLO Interspeech 2019
☆46Aug 16, 2022Updated 3 years ago
Alternatives and similar repositories for speech_yolo
Users that are interested in speech_yolo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Mar 26, 2026Updated 2 months ago
- clip retrieval benchmark☆17May 4, 2022Updated 4 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Apr 3, 2022Updated 4 years ago
- More Than YOLO(v3, v4, v3-tiny, v4-tiny)☆154Feb 14, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Nov 14, 2020Updated 5 years ago
- ICLR 2019 Paper, "Characterizing Audio Adversarial Examples using Temporal Dependency".☆12Apr 3, 2019Updated 7 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Jun 28, 2015Updated 10 years ago
- Python C extension for the eSpeak speech synthesizer☆12Jan 23, 2021Updated 5 years ago
- Griffin-Lim Like Phase Recovery via Alternating Direction Method of Multipliers (Yoshiki Masuyama et al., 2018)☆14Dec 17, 2018Updated 7 years ago
- ☆10Mar 21, 2018Updated 8 years ago
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…☆11May 29, 2016Updated 10 years ago
- ☆11Oct 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- YOLOv4 Pytorch implementation with all freebies and specials and 15+ more exclusive improvements. Easy to use!☆132Aug 3, 2021Updated 4 years ago
- Natural language dataset for training a Conversational Recommender System☆11Jul 9, 2019Updated 6 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Mar 14, 2018Updated 8 years ago
- Lightweight face detectors with landmarks. Training code using pytorch and inference using pytorch/ncnn/tensorflow/tflite.☆10Jul 1, 2020Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- ☆11Jan 20, 2021Updated 5 years ago
- Audio recorder and player based on STM32☆12Jul 21, 2016Updated 9 years ago
- Bob Telegram Tools is a python library that allows you to monitor your machine learning methods just by using Telegram without any additi…☆11Jul 10, 2020Updated 5 years ago
- Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).☆32Jul 25, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 5 years ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 3 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Jul 20, 2022Updated 3 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- Calculates the Word Error Rate between two text files☆20Nov 10, 2022Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- Analytic signal spectrograms with optimized time-frequency resolution☆10Oct 6, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Feb 1, 2017Updated 9 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Official Implementation of Mockingjay in Pytorch☆56Jul 6, 2023Updated 2 years ago
- PowerWorkflow is an open-source collection of agentic workflows for power system applications. These workflows enable intelligent automat…☆32Jul 19, 2025Updated 10 months ago
- A Chainer implementation of Fast WaveNet(mel-spectrogram vocoder).☆31Jul 27, 2018Updated 7 years ago