leonardltk / Shazam-An-Industrial-Strength-Audio-Search-Algorithm-Links
Detecting segments belonging to which song in database, and return Nil if does not exist in a database.
☆22Updated 4 years ago
Alternatives and similar repositories for Shazam-An-Industrial-Strength-Audio-Search-Algorithm-
Users that are interested in Shazam-An-Industrial-Strength-Audio-Search-Algorithm- are comparing it to the libraries listed below
Sorting:
- Zero-Shot Foreign Accent Conversion without a Native Reference☆34Updated last year
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆12Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- ☆33Updated 3 years ago
- ☆57Updated last year
- speaker-disentangled speech linguistic content quantizer☆22Updated 6 months ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- ☆56Updated 2 years ago
- Project of Singing Voice Conversion.☆15Updated last year
- A simple voice conversion tool☆18Updated 3 years ago
- ☆12Updated 2 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Updated 6 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 4 years ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- Singing voice detection☆15Updated 7 years ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated 2 years ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆10Updated 3 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆34Updated 2 years ago
- Uses machine learning to denoise audio containing speech☆40Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated 3 months ago
- ☆22Updated 11 months ago
- A Neural Audio Codec (NAC) for Universal Audio☆42Updated 3 months ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆26Updated 7 months ago
- ☆38Updated 2 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆51Updated 4 years ago
- My vocoder experiments☆31Updated last month
- Onnx compatible styletts2 code☆13Updated 3 months ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Updated 3 years ago