resemble-ai / normalise
A module for normalising text.
☆9Updated 4 years ago
Related projects: ⓘ
- Simple text to phonemes converter for multiple languages☆21Updated last year
- automatically align transcribed audio and generate a wav2letter training corpus☆34Updated last year
- python wrapper for rnnoise library☆45Updated last year
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆22Updated 2 years ago
- Python implementation of the "Shazam" algorithm☆47Updated 5 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- Singing voice detection☆15Updated 6 years ago
- Wave-U-Net for automatic (drum) mixing☆38Updated last year
- A repository for dictionaries to be used with the Prosodylab-Aligner☆17Updated 10 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆35Updated last week
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 6 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 5 years ago
- ☆23Updated this week
- Detect individual instruments activity in an audio file. 🎤🎹🎸🥁☆14Updated 3 years ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆75Updated 3 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition…☆97Updated 2 years ago
- Forced Alignments for Common Voice☆29Updated 3 years ago
- Mellotron singing synthesizer using CPU☆13Updated last year
- ☆15Updated last year
- ☆31Updated 2 years ago
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆41Updated last week
- Pre-trained model and script to automatically align lyrics to polyphonic audio☆101Updated 4 years ago
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments☆41Updated 3 years ago
- A recursive forced aligner built on Gentle.☆16Updated 5 years ago
- Pitch-shifting, time-stretching, and vocoding of speech with Controllable LPCNet (CLPCNet)☆147Updated 2 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 5 years ago
- Deep Performer: Score-to-audio music performance synthesis☆41Updated last year
- Single Pass Spectrogram Inversion in a Jupyter Python notebook☆33Updated 7 years ago