Predict the speaker's gender from an audio file (Flask API included)
☆20May 1, 2023Updated 3 years ago
Alternatives and similar repositories for Gender-Recognition-by-Voice
Users that are interested in Gender-Recognition-by-Voice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆129Apr 25, 2023Updated 3 years ago
- ☆16Sep 12, 2019Updated 6 years ago
- The internet's fastest YouTube downloader made with FFmpeg.WASM.☆13Jul 22, 2023Updated 2 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SinGlow is a part of my Singing voice synthesis system. It can extract features of sound, particularly songs and musics. Then we can use …☆11Oct 9, 2021Updated 4 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- Detecting GAN generated Images using Convolutional Neural Networks☆22Jan 12, 2023Updated 3 years ago
- Code for Discriminative Sounding Objects Localization (NeurIPS 2020)☆60Jan 19, 2022Updated 4 years ago
- ☆16Sep 30, 2023Updated 2 years ago
- ☆25Jun 25, 2021Updated 4 years ago
- python library for reverse engineered Adobe Firefly API☆13Mar 31, 2023Updated 3 years ago
- Code and data release for the paper "Learning from noisy labels by distillation"☆21Nov 17, 2017Updated 8 years ago
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆26Jan 4, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Dec 9, 2024Updated last year
- PyTorch implementations of neural network models for keyword spotting☆11Oct 19, 2020Updated 5 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- a python toolbox for deepfake detection☆29Oct 12, 2020Updated 5 years ago
- WebAssembly port of Rhubarb Lip Sync - an advanced lip sync tool that automatically creates mouth animation from audio files. Perfect for…☆27Sep 3, 2025Updated 8 months ago
- Logo detection in images using SSD☆10Jul 13, 2018Updated 7 years ago
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆17May 14, 2022Updated 3 years ago
- Forced alignment decoder for Whisper.☆15Mar 13, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Sep 1, 2021Updated 4 years ago
- the xelatex version of the Blog of Prof. Xinlong Wang: http://xlwangnu.blog.163.com/☆11Apr 3, 2024Updated 2 years ago
- Generate and translate a SubRip file from a video using Watson Speech to Text and Globalization Pipeline.☆17Oct 17, 2017Updated 8 years ago
- Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.☆12Oct 4, 2020Updated 5 years ago
- Automatic Arabic diacritics restoration tool.☆18Aug 12, 2021Updated 4 years ago
- ☆18Jan 10, 2024Updated 2 years ago
- ☆14Sep 17, 2022Updated 3 years ago
- [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords☆19Nov 30, 2024Updated last year
- Embedded Tajweed annotation for the Qur'an☆11Nov 30, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Combines Apify's crawling system and article parsing with unfluff library.☆12Jul 10, 2024Updated last year
- Convert LaBSE model from TF Hub to PyTorch.☆15Jan 15, 2026Updated 3 months ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆23Feb 24, 2022Updated 4 years ago
- Repository to document results of an Tacotron 2 adaptation for brazilian portuguese.☆17Sep 8, 2022Updated 3 years ago
- Original VinVL visual backbone with simplified APIs to easily extract features, boxes, object detections, in a few lines of Python code.☆12Nov 27, 2022Updated 3 years ago
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Jul 10, 2018Updated 7 years ago
- Collection of various MATLAB functions for spatial audio processing released by the 3D3A Lab at Princeton University☆18Dec 19, 2023Updated 2 years ago