zilliz-bootcamp / audio_searchLinks
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
☆26Updated 3 years ago
Alternatives and similar repositories for audio_search
Users that are interested in audio_search are comparing it to the libraries listed below
Sorting:
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆71Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆32Updated last year
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆34Updated 5 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆105Updated 2 years ago
- paraformer(chinense asr) online onnx runtime for python☆48Updated last year
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆69Updated 2 weeks ago
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆31Updated 3 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆64Updated 3 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆121Updated 10 months ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆74Updated 3 months ago
- ☆9Updated 5 years ago
- (已过时)WaveNet 声码器☆21Updated 5 years ago
- 中文逆文本正则化 (Chinese ITN, Chinese Inverse Text Normalization) ,即将文本中的中文数字转为阿拉伯数字。☆15Updated last year
- ☆75Updated 3 years ago
- Python的音频工具☆15Updated 8 months ago
- ncnn HiFi-GAN☆26Updated 9 months ago
- (pytorch) multi speaker TTS,☆68Updated 5 years ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆12Updated 3 years ago
- chinese real time voice cloning☆38Updated 5 years ago
- Python Wrapper of Silero VAD☆57Updated 2 months ago
- ☆30Updated 6 years ago
- qwen2 and llama3 cpp implementation☆45Updated last year
- Detecting segments belonging to which song in database, and return Nil if does not exist in a database.☆21Updated 4 years ago
- Whisper in TensorRT-LLM☆16Updated last year
- Python bindings of speexdsp noise suppression library☆39Updated 2 years ago
- A library for adding punctuation into a text from ASR.☆18Updated 2 years ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆96Updated 9 months ago