Tools for parsing the audio track in television news programs
☆19Apr 24, 2021Updated 4 years ago
Alternatives and similar repositories for Audio
Users that are interested in Audio are comparing it to the libraries listed below
Sorting:
- ☆10Jan 27, 2017Updated 9 years ago
- ☆18Aug 29, 2020Updated 5 years ago
- This module aims to extract emotions from audio. The input argument is either an uploaded audio/video file to the server or a URL. The o…☆22Apr 3, 2018Updated 7 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Feb 25, 2017Updated 9 years ago
- JSON schema and JavaScript model classes for dealing with time-aligned transcripts of speech.☆16Aug 20, 2018Updated 7 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 10 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- Text-based media editing interface☆16Aug 9, 2017Updated 8 years ago
- Dialect identification using Siamese network☆15Dec 12, 2017Updated 8 years ago
- pronunciation LEXicons for Any Low-resource Language☆21Jul 14, 2020Updated 5 years ago
- PolyglotDB is a package for phonetic corpus storage and analysis☆50Jan 30, 2026Updated 3 weeks ago
- A simple toolkit for speaker segmentation and identification☆31Jun 15, 2013Updated 12 years ago
- The EMU-webApp is an online and offline web application for labeling, visualizing and correcting speech and derived speech data.☆53Feb 18, 2026Updated last week
- Toolkit for supporting the EBU-TT Live specification☆27Oct 11, 2023Updated 2 years ago
- Python classes for the Buckeye Corpus☆26Mar 30, 2018Updated 7 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- Detect calls of attention in the surroundings☆52Jun 10, 2013Updated 12 years ago
- ☆32Aug 4, 2021Updated 4 years ago
- Gazetteer of the Ancient Near East Data☆10Aug 1, 2013Updated 12 years ago
- ☆30Nov 9, 2018Updated 7 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- an tutorial implement of voice conversion using pytorch☆34Mar 30, 2018Updated 7 years ago
- Learning embeddings for laughter categorization☆34Nov 3, 2018Updated 7 years ago
- TTML Profiles for Internet Media Subtitles and Captions (IMSC)☆33Feb 16, 2026Updated last week
- Unicode Standard tokenization routines and orthography profile segmentation☆39Feb 20, 2025Updated last year
- ABX and kaldi experiments on speech corpora made easy☆33Oct 7, 2024Updated last year
- Automatic Dialect Detection Repository☆39Nov 13, 2022Updated 3 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- Django Code for the Webpage☆10Jan 26, 2025Updated last year
- ☆16Updated this week
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆14Oct 8, 2016Updated 9 years ago
- Toolkit for developing OData web services. Can be used from Web API, Nancy, or the platform of your choice.☆14Jun 30, 2022Updated 3 years ago
- Introduction to Algorithms, Third Edition.☆10Apr 2, 2017Updated 8 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- Kalenis LIMS backend☆16Feb 14, 2026Updated last week
- Github mirror of MediaWiki extension Wikispeech - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Develo…☆12Updated this week