zilliz-bootcamp / audio_search
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
☆23Updated 3 years ago
Alternatives and similar repositories for audio_search:
Users that are interested in audio_search are comparing it to the libraries listed below
- Port of Funasr's Paraformer model in C/C++☆28Updated 8 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆76Updated last year
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated last year
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆54Updated 2 years ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆34Updated 4 years ago
- paraformer(chinense asr) online onnx runtime for python☆40Updated 11 months ago
- ncnn HiFi-GAN☆26Updated 5 months ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- Python bindings of speexdsp noise suppression library☆37Updated 2 years ago
- ☆31Updated 3 years ago
- A library for adding punctuation into a text from ASR.☆16Updated last year
- Whisper in TensorRT-LLM☆15Updated last year
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆29Updated 3 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆59Updated 6 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆42Updated this week
- 单独维护的中文TTS☆35Updated 2 years ago
- Python的音频工具☆12Updated 3 months ago
- ☆74Updated 2 years ago
- 使用onnxruntime部署实时视频帧插值,包含C++和Python两个版本的程序☆25Updated last year
- Python Wrapper of Silero VAD☆47Updated 2 months ago
- mnn asr demo.☆13Updated 2 months ago
- one script for xls-r/xlsr/whisper fine-tuning☆40Updated last year
- ☆37Updated 3 years ago
- Utilizes ONNX Runtime for audio denoising.☆33Updated 3 weeks ago
- qwen2 and llama3 cpp implementation☆40Updated 8 months ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆61Updated 2 months ago
- some ncnn demos of FunASR☆23Updated 5 months ago
- chinese real time voice cloning☆39Updated 5 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆63Updated 3 years ago