h7shin / audiosearchengine
Python Audio Search Engine: search for audio .wav files based on percent similarity
☆14Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for audiosearchengine
- 根据MFCC提取音频特征,训练“飞鱼秀”音频节目语音和音乐的切割。☆30Updated 6 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆31Updated last year
- A deep learning solution to the Query By Singing/Humming (QBSH) problem in Music Information Retrieval (MIR).☆15Updated 7 years ago
- Transformer based ASR Engine.☆12Updated 3 years ago
- Project of Singing Voice Conversion.☆14Updated last year
- A python wrapper for kaldi-online-decoder using Cython☆12Updated 7 years ago
- Podcast Summarizer with LLM Technology☆17Updated last year
- noise reduction☆17Updated 4 months ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 6 years ago
- python wrap for hts engine☆14Updated 6 years ago
- This is a TTS model based on VITS that can control the output speech emotion through natural language and control the speaker through ref…☆4Updated 2 months ago
- 基于DNN神经网络的简单语音唤醒☆11Updated 5 years ago
- Spleeter implementation in pytorch☆38Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆43Updated 2 months ago
- wake word spotting with kaldi☆19Updated 3 years ago
- 语音唤醒☆9Updated 5 years ago
- ☆10Updated 3 months ago
- video cut powered by AI☆25Updated last year
- Using Kaldi (Automatic Speech Recognition) and Gentle (Forced Word Aligner), this script finds both rhymes and alliteration in speeches w…☆13Updated 6 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- Python implementation of the "Shazam" algorithm☆48Updated 5 years ago
- A minimum inference engine for DiffSinger☆34Updated 7 months ago
- STT Service based on Kaldi ASR☆15Updated 6 years ago
- ☆33Updated 2 years ago
- This repository contains the migrated code of Spleeter from Deezer in TF2.0☆27Updated 3 years ago
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆11Updated last year
- zero-shot realtime TTS system, fully offline, free and open source☆13Updated this week
- ☆22Updated 3 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆15Updated 5 years ago
- Methods to compute various chroma audio features and audio similarity measures particularly for the task of cover song identification☆24Updated 4 years ago