Predict the speaker's gender from an audio file (Flask API included)
☆20May 1, 2023Updated 2 years ago
Alternatives and similar repositories for Gender-Recognition-by-Voice
Users that are interested in Gender-Recognition-by-Voice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Nov 13, 2021Updated 4 years ago
- Official code repository for ICCV 2021 paper: Gravity-Aware Monocular 3D Human Object Reconstruction☆17Oct 12, 2021Updated 4 years ago
- Standard libraries for audio processing, especially STFT and Spherical Harmonics decomposition of a soundfield.☆10Nov 29, 2021Updated 4 years ago
- A small song with Remotion + Tune.JS☆10Feb 23, 2024Updated 2 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- SinGlow is a part of my Singing voice synthesis system. It can extract features of sound, particularly songs and musics. Then we can use …☆11Oct 9, 2021Updated 4 years ago
- 不挂科AI后端是一个基于FastAPI框架构建的Web应用程序,旨在为用户提供一系列智能化的服务,包括视频转PPT、PPT转PDF、PDF和PPT内容解析、考试重点大纲生成、出题、思维导图生成等功能。该后端服务使用了多种Python库,如FastAPI、PyPDF2、pyt…☆16Oct 30, 2024Updated last year
- ☆16Sep 30, 2023Updated 2 years ago
- ☆25Jun 25, 2021Updated 4 years ago
- The Jazz Trio Database is a dataset composed of about 45 hours of jazz performances annotated by an automated signal processing pipeline.☆13Sep 27, 2025Updated 6 months ago
- (Experimental) Predicting hand assignments in piano MIDI using neural networks☆13Oct 11, 2024Updated last year
- This is the implementation of the paper "VAW-GAN for Singing Voice Conversion withNon-parallel Training Data".☆17Aug 12, 2020Updated 5 years ago
- PyTorch implementations of neural network models for keyword spotting☆11Oct 19, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- MacOS executable and app of Piano transcription, which generates midi files with music using Machine Learning.☆11Nov 9, 2024Updated last year
- Logo detection in images using SSD☆10Jul 13, 2018Updated 7 years ago
- Visualization tools for audio-only and multi-modal speaker diarization dataset☆13Oct 27, 2023Updated 2 years ago
- ☆24Feb 15, 2020Updated 6 years ago
- ☆24Dec 10, 2023Updated 2 years ago
- ☆12Sep 1, 2021Updated 4 years ago
- the xelatex version of the Blog of Prof. Xinlong Wang: http://xlwangnu.blog.163.com/☆11Apr 3, 2024Updated last year
- Code supporting the ISMIR 2020 Klio Tutorial☆20Oct 11, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.☆12Oct 4, 2020Updated 5 years ago
- Automatic Arabic diacritics restoration tool.☆18Aug 12, 2021Updated 4 years ago
- used to evaluate wavenet vocoder by rmse f0, MCD, rmse ap...☆15Jan 20, 2020Updated 6 years ago
- An ML Approach to Stem Separation and Music Transcription.☆16Dec 12, 2024Updated last year
- ☆18Jan 10, 2024Updated 2 years ago
- ☆18Feb 24, 2022Updated 4 years ago
- Embedded Tajweed annotation for the Qur'an☆11Nov 30, 2025Updated 3 months ago
- Combines Apify's crawling system and article parsing with unfluff library.☆12Jul 10, 2024Updated last year
- Front-end for symbolic music AI models☆17Nov 20, 2025Updated 4 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Convert LaBSE model from TF Hub to PyTorch.☆16Jan 15, 2026Updated 2 months ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆23Feb 24, 2022Updated 4 years ago
- Piano performance visualizer.☆10Jun 8, 2022Updated 3 years ago
- 📚 雨云百科的源码,欢迎发起PR,一起来编写吧!☆20Sep 26, 2025Updated 6 months ago
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Jul 10, 2018Updated 7 years ago
- Real time multilingual face translator☆38Jul 26, 2025Updated 8 months ago
- Automatically exported from code.google.com/p/flow-tools☆13Jan 22, 2022Updated 4 years ago