znaoya / aenetView external linksLinks
AENet: audio feature extraction
☆60Aug 30, 2019Updated 6 years ago
Alternatives and similar repositories for aenet
Users that are interested in aenet are comparing it to the libraries listed below
Sorting:
- Code and demos for our paper at ACM MM 2017☆62May 2, 2019Updated 6 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- ☆15Nov 6, 2017Updated 8 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Extracts the shot classes and generic visual features for a broadcast news video.☆13Jul 23, 2017Updated 8 years ago
- The Video2GIF dataset with 100k GIFs from our paper at CVPR2016☆101Aug 10, 2017Updated 8 years ago
- A dataset with user created GIFs☆49Oct 7, 2018Updated 7 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- Documented code with instructions to reproduce results of paper submitted to ECML☆13Oct 11, 2018Updated 7 years ago
- Phonetic and phonological vocoding platform☆17Nov 23, 2016Updated 9 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 9 years ago
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language☆43Feb 28, 2018Updated 7 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- Bayesian spEEch Recognizer☆55Jan 11, 2021Updated 5 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Oct 8, 2018Updated 7 years ago
- Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model☆17Nov 24, 2016Updated 9 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- Analytic signal-based source information analysis for YANGstraight and real-time interactive tools☆34Aug 20, 2019Updated 6 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- A C++ library for parsing and manipulating JSGF grammar files.☆14Feb 13, 2024Updated 2 years ago