zilliz-bootcamp / audio_searchLinks
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
☆28Updated 4 years ago
Alternatives and similar repositories for audio_search
Users that are interested in audio_search are comparing it to the libraries listed below
Sorting:
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆82Updated 3 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆120Updated 2 years ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆34Updated 5 years ago
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- paraformer(chinense asr) online onnx runtime for python☆53Updated last year
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- ☆68Updated last year
- Python的音频工具☆16Updated 2 weeks ago
- ☆33Updated 4 years ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 5 years ago
- A library for adding punctuation into a text from ASR.☆19Updated 2 years ago
- 基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。☆175Updated last year
- chinese real time voice cloning☆38Updated 6 years ago
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆302Updated this week
- A Tiny Project For ASR model training and Deployment☆27Updated 3 years ago
- 超快的中文普通话TTS☆122Updated 4 years ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆144Updated 4 months ago
- Python Wrapper of Silero VAD☆62Updated 7 months ago
- ☆75Updated 3 years ago
- ☆31Updated 6 years ago
- (pytorch) multi speaker TTS,☆67Updated 6 years ago
- ☆40Updated 4 years ago
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆31Updated 3 years ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆91Updated 3 weeks ago
- A demo of zh/Chinese Text to Speech system run on CPU in real time. 中文实时语音合成系统Demo☆181Updated 3 years ago
- 这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成☆53Updated 7 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆65Updated 3 years ago
- mfcc, mel, pcen. (librosa)☆36Updated 6 years ago
- (已过时)WaveNet 声码器☆21Updated 5 years ago
- Transferability of cross-lingual and cross-age speech emotion recognition☆20Updated 2 years ago