SpeechYOLO Interspeech 2019
☆46Aug 16, 2022Updated 3 years ago
Alternatives and similar repositories for speech_yolo
Users that are interested in speech_yolo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆10Nov 5, 2020Updated 5 years ago
- Train a LSTM neural networks on Vox Forge public audio data set to recognize speaker's gender☆13Oct 27, 2017Updated 8 years ago
- Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"☆10Jan 25, 2021Updated 5 years ago
- An online speech recognition extension toolkit of Kaldi☆56Jun 23, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Apr 3, 2022Updated 3 years ago
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Nov 14, 2020Updated 5 years ago
- Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification p…☆16Jun 28, 2015Updated 10 years ago
- ICLR 2019 Paper, "Characterizing Audio Adversarial Examples using Temporal Dependency".☆12Apr 3, 2019Updated 6 years ago
- Python C extension for the eSpeak speech synthesizer☆12Jan 23, 2021Updated 5 years ago
- Griffin-Lim Like Phase Recovery via Alternating Direction Method of Multipliers (Yoshiki Masuyama et al., 2018)☆14Dec 17, 2018Updated 7 years ago
- ☆10Mar 21, 2018Updated 8 years ago
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…☆11May 29, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Oct 19, 2024Updated last year
- Natural language dataset for training a Conversational Recommender System☆11Jul 9, 2019Updated 6 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Mar 14, 2018Updated 8 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- A database of clean and noisy speech for audio research☆10Jan 26, 2018Updated 8 years ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆46Oct 3, 2023Updated 2 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Jul 20, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- Calculates the Word Error Rate between two text files☆20Nov 10, 2022Updated 3 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- Analytic signal spectrograms with optimized time-frequency resolution☆10Oct 6, 2020Updated 5 years ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- Speech Parameter Estimation Using Differentiable Speech Synthesizer☆44May 9, 2023Updated 2 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆50Feb 1, 2017Updated 9 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Jun 13, 2022Updated 3 years ago
- Official Implementation of Mockingjay in Pytorch☆56Jul 6, 2023Updated 2 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Sep 18, 2017Updated 8 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Apr 25, 2018Updated 7 years ago
- Code for paper NeuroGen: activation optimized image synthesis for discovery neuroscience.☆11Sep 24, 2023Updated 2 years ago
- Long audio alignment using Kaldi☆23Apr 22, 2021Updated 4 years ago
- Recognizing a speaker using Deep Learning☆11Dec 25, 2017Updated 8 years ago