zhihanyang2022 / gender-audio-classificationView external linksLinks
A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.
☆27Nov 18, 2021Updated 4 years ago
Alternatives and similar repositories for gender-audio-classification
Users that are interested in gender-audio-classification are comparing it to the libraries listed below
Sorting:
- Machine learning experiment to perform gender classification from raw audio.☆23Sep 1, 2018Updated 7 years ago
- Audio MNIST Classification using 1D-CNN, 2D-CNN, GAN+2D-CNN, CVN+RandomForest, and LSTMs.☆14Dec 7, 2021Updated 4 years ago
- Multi-class audio classification with MFCC features using CNN☆31Jan 4, 2020Updated 6 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆220Jul 6, 2023Updated 2 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Jul 21, 2018Updated 7 years ago
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated last year
- Find bimodal, unimodal, and multimodal features in your data☆27Oct 26, 2018Updated 7 years ago
- D&M Landing Page Engine - OpenSource PHP landing page engine/constructor to create landing pages with dynamic content☆10May 19, 2017Updated 8 years ago
- This project is to develop a named entity recognition (NER) model to identity medical entities such as diseases, symptoms, treatments in…☆12Oct 15, 2024Updated last year
- XCORE-VOICE Solution☆17Jun 12, 2025Updated 8 months ago
- TypeScript SDK for programmatic access to Google NotebookLM☆23Jan 14, 2026Updated last month
- Audio classification via transfer learning☆35Oct 3, 2019Updated 6 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- Feature extraction from sound signals along with complete CNN model and evaluations using tensorflow, keras and, librosa for MFCC generat…☆10Jan 1, 2022Updated 4 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- ☆12Nov 12, 2024Updated last year
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- A fine multimodality fusion network :)☆11Aug 9, 2021Updated 4 years ago
- pytorch+bert实现的意图识别与槽位填充☆11May 30, 2023Updated 2 years ago
- ☆10Oct 16, 2025Updated 4 months ago
- WebRTC based video conferencing SDK for iOS (Swift / Objective C)☆13Jan 27, 2026Updated 3 weeks ago
- Leverage 3D video and Spatial Audio to deliver an immersive experience.☆11Oct 11, 2023Updated 2 years ago
- ☆10Nov 10, 2021Updated 4 years ago
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- ☆11Oct 1, 2021Updated 4 years ago
- ☆10Apr 18, 2022Updated 3 years ago
- ☆11Sep 26, 2022Updated 3 years ago
- Exposure-slot: Exposure-centric representations learning with Slot-in-Slot Attention for Region-aware Exposure Correction, Computer Visi…☆21Sep 2, 2025Updated 5 months ago
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated 11 months ago
- Processing ECG Signal, QRS and ST Segment Detection, BPM Calculation, ST Slope Measurement and Myocardial Ischemia Detection.☆12Jun 27, 2020Updated 5 years ago
- Deep learning application for predicting ocean wave behaviors.☆15May 31, 2020Updated 5 years ago
- 🎭 Official code and dataset for our CCGPK@COLING 2022 paper - "PersonaChatGen: Generating Personalized Dialogue using GPT-3"☆13Mar 26, 2024Updated last year
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated 11 months ago
- In this work is proposed a speech emotion recognition model based on the extraction of four different features got from RAVDESS sound fil…☆10Feb 27, 2022Updated 3 years ago
- An experiment with movie scenes and contrastive learning☆11Feb 1, 2025Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- ☆13Feb 8, 2017Updated 9 years ago
- A common protocol for AI agent tools☆10Oct 21, 2024Updated last year