zilliz-bootcamp / audio_searchLinks
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
☆24Updated 3 years ago
Alternatives and similar repositories for audio_search
Users that are interested in audio_search are comparing it to the libraries listed below
Sorting:
- Port of Funasr's Paraformer model in C/C++☆31Updated 11 months ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated 2 years ago
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆67Updated 2 years ago
- Python的音频工具☆14Updated 6 months ago
- paraformer(chinense asr) online onnx runtime for python☆44Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆100Updated 2 years ago
- ☆75Updated 2 years ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆34Updated 5 years ago
- qwen2 and llama3 cpp implementation☆44Updated last year
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆12Updated 3 years ago
- some ncnn demos of FunASR☆25Updated 8 months ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 4 years ago
- ncnn HiFi-GAN☆26Updated 8 months ago
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆31Updated 3 years ago
- a naive example of LivePortrait infer by ncnn☆41Updated 10 months ago
- Python Wrapper of Silero VAD☆54Updated 3 weeks ago
- Whisper in TensorRT-LLM☆15Updated last year
- Python bindings of speexdsp noise suppression library☆38Updated 2 years ago
- Using OpenVINO to speed up MeloTTS inference☆11Updated 7 months ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆64Updated 3 years ago
- mnn asr demo.☆18Updated 2 months ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- ☆32Updated 3 years ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆116Updated 9 months ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆75Updated 9 months ago
- chinese real time voice cloning☆38Updated 5 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆62Updated last month
- Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python☆12Updated 3 years ago