winston-wen / Shazam
A naive implementation of Shazam algorithm in Java
☆51Updated 8 years ago
Alternatives and similar repositories for Shazam:
Users that are interested in Shazam are comparing it to the libraries listed below
- 根据MFCC提取音频特征,训练“飞鱼秀”音频节目语音和音乐的切割。☆30Updated 7 years ago
- Voice Print Recognition☆80Updated 10 years ago
- speech recognition based on tensorflow 1.0.0☆140Updated 7 years ago
- This repo augments the scripts in CVTE model (http://kaldi-asr.org/models/m2)☆15Updated 5 years ago
- Chinese Speech To Text Using Wavenet☆161Updated last year
- Music Identification Program based on Shazam's methods☆110Updated 11 years ago
- A simple TTS(text-to-speech) engine for Chinese mandarin☆20Updated 12 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- Kaldi Snapshot☆30Updated 11 years ago
- Separation of singing voice and accompaniment☆77Updated 8 years ago
- CMU Sphinx - Speech Recognition Toolkit☆121Updated 14 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆86Updated 4 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆32Updated 7 years ago
- torch7 module to convert one person's voice to another's.☆16Updated 9 years ago
- Keyword spotting by Kaldi library☆26Updated 8 years ago
- Explore Text-To-Speech☆26Updated 6 years ago
- 基于MFCC语音特征提取和识别☆74Updated 9 years ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- Face detection and recognition demo with OpenCV☆9Updated 9 years ago
- The offline part of icytranslate(a english-chinese translate platform) ,the output of this project should be a translate model☆19Updated 7 years ago
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆84Updated 4 years ago
- 这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成☆53Updated 6 years ago
- Voice conversion tools for STRAIGHT☆29Updated 4 years ago
- Efficient voice activity detection algorithms using long-term speech information in C++☆92Updated 5 years ago
- A modified version of Speech Signal Processing Toolkit (SPTK)☆89Updated 2 years ago
- This is MFCC c++ code☆28Updated 4 years ago
- A Demo of Mandarin/Chinese TTS frontend☆279Updated 2 years ago
- 从webrtc抽离出来的vad源代码,供语音分析/检测使用☆30Updated 7 years ago
- 基于双门限识别的语音端点检测系统☆24Updated 7 years ago
- speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…☆396Updated 4 years ago