zilliz-bootcamp / audio_searchLinks
This project use PANNs for audio tagging and sound event detection, and finally get audio embeddings. Then Milvus is used to search the similarity audio items.
☆26Updated 3 years ago
Alternatives and similar repositories for audio_search
Users that are interested in audio_search are comparing it to the libraries listed below
Sorting:
- Clone a voice in 5 seconds to generate arbitrary speech in real-time☆34Updated 5 years ago
- Port of Funasr's Paraformer model in C/C++☆32Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆102Updated 2 years ago
- 语音识别模型pytorch转ONNX转MNN,C++实现部署☆68Updated 2 years ago
- paraformer(chinense asr) online onnx runtime for python☆46Updated last year
- Whisper in TensorRT-LLM☆16Updated last year
- ncnn HiFi-GAN☆26Updated 8 months ago
- ☆29Updated 5 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- Python的音频工具☆14Updated 7 months ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Updated 2 years ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 4 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- (已过时)WaveNet 声码器☆21Updated 5 years ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆72Updated 2 months ago
- ☆9Updated 5 years ago
- Huawei Grad-TTS for Chinese☆50Updated last year
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆31Updated 3 years ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, Korean, Chinese, German and Ea…☆14Updated 4 years ago
- ☆75Updated 3 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆64Updated 3 years ago
- 单独维护的中文TTS☆35Updated 2 years ago
- some ncnn demos of FunASR☆25Updated 9 months ago
- mnn asr demo.☆20Updated 3 months ago
- qwen2 and llama3 cpp implementation☆44Updated last year
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Updated last year
- Python Wrapper of Silero VAD☆55Updated last month
- Project of Singing Voice Conversion.☆14Updated last year
- ☆32Updated 3 years ago