Util code, issues, discussions
☆29Aug 31, 2018Updated 7 years ago
Alternatives and similar repositories for MIREX-2018-Automatic-Lyrics-to-Audio-Alignment
Users that are interested in MIREX-2018-Automatic-Lyrics-to-Audio-Alignment are comparing it to the libraries listed below
Sorting:
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Mar 14, 2018Updated 7 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- Code for ICASSP 2019 paper☆18Oct 29, 2018Updated 7 years ago
- Database of annotated field recording samples that can be used for training audio labelling algorithms☆10Feb 1, 2019Updated 7 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆31May 14, 2024Updated last year
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Interspeech 2019 tutorial materials☆49Sep 26, 2019Updated 6 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- Wave-U-Net for automatic (drum) mixing☆38Mar 24, 2023Updated 2 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- Detect individual instruments activity in an audio file. 🎤🎹🎸🥁☆16Jun 29, 2021Updated 4 years ago
- ☆17Oct 16, 2018Updated 7 years ago
- ☆42Oct 30, 2018Updated 7 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆48Jun 3, 2020Updated 5 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- ☆15May 8, 2021Updated 4 years ago
- Music structure segmentation with convnets☆13Mar 11, 2016Updated 9 years ago
- ☆33Jun 29, 2023Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignme…☆59Mar 9, 2020Updated 6 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆22Jan 18, 2023Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- FVN is now obsolete. Please use CAPRICEP instead. I will stop updating this tool. Frequency domain variants of Velvet Noise, a flexible b…☆38Aug 12, 2020Updated 5 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago