Companion toolkit of the 'Serial Speakers' dataset.
☆11Feb 17, 2020Updated 6 years ago
Alternatives and similar repositories for Serial-Speakers
Users that are interested in Serial-Speakers are comparing it to the libraries listed below
Sorting:
- ☆20Nov 3, 2021Updated 4 years ago
- Text-based media editing interface☆16Aug 9, 2017Updated 8 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- Experiment in automatic insertion of timed transcript corrections☆21Oct 31, 2017Updated 8 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- HFODetector is Python package that that is capable of detecting HFOs with STE / MNI / Hilbert detector. Detection speed is increased by u…☆12Feb 16, 2025Updated last year
- VoxSRC Challenge☆31Jun 11, 2019Updated 6 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Apr 24, 2020Updated 5 years ago
- ☆10Dec 13, 2025Updated 2 months ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- Python toolkit for Visual Speech Recognition☆38Jun 10, 2020Updated 5 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆14Oct 8, 2016Updated 9 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- ☆13Feb 27, 2026Updated last week
- Github mirror of MediaWiki extension Wikispeech - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Develo…☆12Updated this week
- ☆10Jul 24, 2019Updated 6 years ago
- Listen to the weather using Sonic Pi and data from Mathematica☆11Dec 6, 2018Updated 7 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- Mirror of GlottHMM☆10Jun 7, 2016Updated 9 years ago
- dvbshout takes an MPEG transport stream from a DVB card, extracts audio channels from stream, and sends the audio to an Icecast / Shoutca…☆10Jul 29, 2021Updated 4 years ago
- A Pytorch implementation of 'Progressive Neural Networks for Transfer Learning in Emotion Recognition'☆11Jul 31, 2018Updated 7 years ago
- ☆10Feb 19, 2021Updated 5 years ago
- This iPython Notebook is created as a part of the Digital Signal Processing (DSP) class offered at EPFL to explain the process of MP3 enc…☆11Mar 7, 2015Updated 10 years ago
- Arabic - English emotion lexicon☆12Apr 24, 2017Updated 8 years ago
- ☆10Feb 27, 2017Updated 9 years ago
- ☆13Oct 25, 2024Updated last year
- Processing for Hearing-Assistive/Augmented-reality Devices (HADES)☆13Jan 13, 2026Updated last month