ina-foss / inaFaceAnalyzer
INA's library with pretrained models for gender and age prediction from faces.
☆21Updated 5 months ago
Alternatives and similar repositories for inaFaceAnalyzer:
Users that are interested in inaFaceAnalyzer are comparing it to the libraries listed below
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments☆43Updated 4 years ago
- 🐸TTS recipes for different datasets☆86Updated 2 years ago
- Creation of a multi user audio first annotation tool - GSoC 2021☆29Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Simple audio recorder that sends WAV from browser to server in Python (Flask).☆31Updated 2 years ago
- REST api for mozilla deepspeech voice recognition engine☆20Updated 3 years ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆37Updated 2 years ago
- A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.☆83Updated 3 years ago
- It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater☆11Updated 6 years ago
- My public domain speech index☆10Updated 5 years ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 5 years ago
- A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)☆16Updated 7 years ago
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆36Updated last year
- Human age estimation using deep neural networks (Keras)☆12Updated last year
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.☆51Updated 6 years ago
- Learning Lip Sync of Obama from Speech Audio☆67Updated 4 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Updated 7 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Community framework for training tortoise☆41Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 6 months ago
- mirror of VoxCeleb dataset - a large-scale speaker identification dataset☆71Updated 5 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality☆21Updated 5 years ago
- ☆22Updated 3 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- repo for active speaker detection for media videos.☆26Updated last year
- Speech to Facial Animation using GANs☆41Updated 3 years ago
- Zoom Audio Transcription offline☆32Updated 4 years ago
- Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.☆85Updated last year