zilliz-bootcamp / audio_searchLinks
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
☆26Updated 4 years ago
Alternatives and similar repositories for audio_search
Users that are interested in audio_search are comparing it to the libraries listed below
Sorting:
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆74Updated 3 years ago
- Port of Funasr's Paraformer model in C/C++☆35Updated last year
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆112Updated 2 years ago
- paraformer(chinense asr) online onnx runtime for python☆53Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- ☆76Updated 3 years ago
- Python的音频工具☆16Updated 10 months ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- ncnn HiFi-GAN☆29Updated 11 months ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆79Updated 3 weeks ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆135Updated last month
- chinese real time voice cloning☆38Updated 5 years ago
- some ncnn demos of FunASR☆26Updated 11 months ago
- ☆63Updated last year
- A library for adding punctuation into a text from ASR.☆19Updated 2 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆65Updated 3 years ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 4 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆86Updated last year
- ☆33Updated 4 years ago
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆31Updated 3 years ago
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆21Updated 2 years ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆101Updated 11 months ago
- ☆30Updated 6 years ago
- mfcc, mel, pcen. (librosa)☆36Updated 5 years ago
- 用 OCR 提取视频硬字幕☆79Updated 7 months ago
- Python Wrapper of Silero VAD☆59Updated 4 months ago
- Using OpenVINO to speed up MeloTTS inference☆13Updated 10 months ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆34Updated 5 years ago
- PaddleSpeech TTS cpp☆41Updated 2 years ago