zilliz-bootcamp / audio_search
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
☆22Updated 3 years ago
Alternatives and similar repositories for audio_search:
Users that are interested in audio_search are comparing it to the libraries listed below
- ncnn HiFi-GAN☆26Updated 4 months ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆31Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆71Updated last year
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆52Updated 2 years ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆27Updated 7 months ago
- ☆9Updated 4 years ago
- some ncnn demos of FunASR☆22Updated 4 months ago
- paraformer(chinense asr) online onnx runtime for python☆40Updated 10 months ago
- Whisper in TensorRT-LLM☆15Updated last year
- (已过时)WaveNet 声码器☆21Updated 4 years ago
- 使用onnxruntime部署实时视频帧插值,包含C++和Python两个版本的程序☆25Updated 11 months ago
- 使用ONNXRuntime部署百度PaddleSeg发布的实时人像抠图模型PP-MattingV2,一共包含18个onnx模型,依然是包含C++和Python两个版本的程序☆30Updated last year
- A library for adding punctuation into a text from ASR.☆16Updated last year
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆63Updated 2 years ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆34Updated 4 years ago
- a naive example of LivePortrait infer by ncnn☆40Updated 5 months ago
- ☆29Updated 5 years ago
- ☆31Updated 3 years ago
- mnn asr demo.☆10Updated last month
- ☆74Updated 2 years ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆57Updated last month
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆12Updated 4 months ago
- Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python☆11Updated 2 years ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆96Updated 4 months ago
- ncnn Android demo of PP-TinyPose☆25Updated 3 years ago
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆29Updated 3 years ago
- A curated list of awesome voice activity detection☆29Updated 2 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆35Updated last week