ina-foss / inaFaceAnalyzerLinks
INA's library with pretrained models for gender and age prediction from faces.
☆24Updated last year
Alternatives and similar repositories for inaFaceAnalyzer
Users that are interested in inaFaceAnalyzer are comparing it to the libraries listed below
Sorting:
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated last month
- Command line utility to manipulate faces in videos and images☆59Updated 5 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆25Updated 11 months ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆37Updated 2 years ago
- Reducing the Noise in the Audio Signal by Deep Learning Methods☆15Updated 3 years ago
- ☆32Updated 3 years ago
- generate granular word-level captions in srt format☆57Updated 3 years ago
- Zoom Audio Transcription offline☆32Updated 5 years ago
- Community framework for training tortoise☆44Updated 3 years ago
- A set of tools to restore audio quality from a variety of old analog sources, such as tape, cassettes, acetates and vinyl.☆112Updated last month
- TTS Client for Coqui TTS server☆13Updated 3 years ago
- Docker images for Coqui AI☆61Updated 4 years ago
- An automatic movie trailer generator.☆42Updated 3 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆127Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Updated last year
- Tool to extracts the text from a web article urls and get frequency words, entities recognition, automatic summary and more☆20Updated 7 years ago
- ☆14Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- Python package and cli tool to convert wave files (WAV or AIFF) to vector graphics (SVG, PostScript, CVS)☆110Updated 2 weeks ago
- transcribe audio feeds into public web ui☆45Updated 3 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆48Updated 2 years ago
- An in-browser app for labeling audio clips at random, using Docker and Flask.☆53Updated 8 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆27Updated 3 years ago
- Python Audio Separator in Real Time using MDX-NET model☆24Updated 2 years ago
- A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)☆16Updated 8 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 4 years ago
- Code for OpenAI Whisper Web App Demo☆93Updated 3 years ago