zilliz-bootcamp / audio_search
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
☆24Updated 3 years ago
Alternatives and similar repositories for audio_search
Users that are interested in audio_search are comparing it to the libraries listed below
Sorting:
- Port of Funasr's Paraformer model in C/C++☆31Updated 10 months ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆34Updated 5 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆100Updated 2 years ago
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆65Updated 2 years ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- Python的音频工具☆14Updated 6 months ago
- paraformer(chinense asr) online onnx runtime for python☆44Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated 2 years ago
- (已过时)WaveNet 声码器☆21Updated 5 years ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆113Updated 8 months ago
- ☆9Updated 5 years ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆13Updated 3 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆57Updated last month
- Python Wrapper of Silero VAD☆53Updated last week
- Python bindings of speexdsp noise suppression library☆38Updated 2 years ago
- ☆29Updated 5 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆64Updated 3 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 3 years ago
- ☆75Updated 2 years ago
- a naive example of LivePortrait infer by ncnn☆41Updated 9 months ago
- 使用onnxruntime部署实时视频帧插值,包含C++和Python两个版本的程序☆25Updated last year
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆31Updated 3 years ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Ea…☆14Updated 4 years ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆68Updated last month
- ☆52Updated 10 months ago
- mfcc, mel, pcen. (librosa)☆36Updated 5 years ago
- ☆14Updated last year
- A library for adding punctuation into a text from ASR.☆17Updated 2 years ago
- qwen2 and llama3 cpp implementation☆44Updated 11 months ago