taylorlu / MachineLearningDOCLinks
图像、人脸、OCR、语音相关算法整理
☆66Updated 6 years ago
Alternatives and similar repositories for MachineLearningDOC
Users that are interested in MachineLearningDOC are comparing it to the libraries listed below
Sorting:
- C/C++实现Python音频处理库librosa中melspectrogram的计算过程☆31Updated 3 years ago
- 1000点的人脸关键点检测☆154Updated 3 years ago
- Facial landmark detection based on deep convolutional neural network.☆65Updated 7 years ago
- CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统☆48Updated 7 years ago
- Speaker detection using a lip movement based RNN detector☆76Updated 7 years ago
- Spleeter implementation in pytorch☆26Updated 5 years ago
- Face reconstruction and dense alignment☆197Updated 5 years ago
- Official implementation for paper "A Real-Time and Long-Term Face Tracking Method Using Convolutional Neural Network and Optical Flow for…☆204Updated 3 years ago
- face detect and face alignment C++ project☆14Updated 6 years ago
- PyTorch reimplementation of Tacotron2 in Mandarin☆84Updated 4 years ago
- deep-sdm is appied for face landmark.☆73Updated 5 years ago
- ☆82Updated 2 years ago
- alibaba MNN, mobilenet classifier, centerface detecter, ultraface detecter, pfld landmarker and zqlandmarker, mobilefacenet☆207Updated 3 years ago
- Face detection algorithms in PyTorch.☆80Updated 3 years ago
- light-weight 98 points face landmark超轻98点人脸关键点检测模型☆72Updated 4 years ago
- DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.069…☆118Updated 4 years ago
- 人脸贴纸☆36Updated 5 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆47Updated 7 years ago
- 制作亚洲人脸数据集☆112Updated 6 years ago
- Deploy SCRFD, an efficient high accuracy face detection approach, in your web browser with ncnn and webassembly☆49Updated 2 years ago
- Regress Face Attributes with MobileNetV2☆43Updated 5 years ago
- Collection of works from VIPL-AVSU☆49Updated 3 months ago
- ☆76Updated 3 years ago
- The style transfer android example☆100Updated last month
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆87Updated 5 years ago
- Face Tracker using RetinaFace Detector and Kalman Filter☆42Updated 6 years ago
- mfcc, mel, pcen. (librosa)☆36Updated 6 years ago
- A demo of android key word spoting based on tensorflow tutial example☆28Updated 5 years ago
- 106-point face landmarks☆26Updated 6 years ago
- 这个工程的目的是从视频中获取语音识别的训练数据,用于训练字幕自动生成☆53Updated 7 years ago