A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.
☆27Nov 18, 2021Updated 4 years ago
Alternatives and similar repositories for gender-audio-classification
Users that are interested in gender-audio-classification are comparing it to the libraries listed below
Sorting:
- Machine learning experiment to perform gender classification from raw audio.☆23Sep 1, 2018Updated 7 years ago
- Using machine learning to recognise gender by analysing recorded voice.☆12Nov 7, 2025Updated 4 months ago
- Audio MNIST Classification using 1D-CNN, 2D-CNN, GAN+2D-CNN, CVN+RandomForest, and LSTMs.☆14Dec 7, 2021Updated 4 years ago
- Multi-class audio classification with MFCC features using CNN☆31Jan 4, 2020Updated 6 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆221Jul 6, 2023Updated 2 years ago
- ☆35Nov 19, 2023Updated 2 years ago
- This project is to develop a named entity recognition (NER) model to identity medical entities such as diseases, symptoms, treatments in…☆12Oct 15, 2024Updated last year
- XCORE-VOICE Solution☆17Jun 12, 2025Updated 8 months ago
- 利用 python2.7 + OpenCV2.4 + django 1.9 + xamdin完成的一个基于图像搜索的系统。利用 OpenCV sift算法提取图像特征点 进行图像匹配。☆12Dec 8, 2022Updated 3 years ago
- D&M Landing Page Engine - OpenSource PHP landing page engine/constructor to create landing pages with dynamic content☆10May 19, 2017Updated 8 years ago
- Audio classification via transfer learning☆35Oct 3, 2019Updated 6 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- TypeScript SDK for programmatic access to Google NotebookLM☆28Jan 14, 2026Updated last month
- ☆12Nov 12, 2024Updated last year
- [AAAI'23] FinalMLP: An Enhanced Two-Stream MLP Model for CTR Prediction https://arxiv.org/abs/2304.00902☆10Apr 9, 2023Updated 2 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- ☆12Sep 22, 2022Updated 3 years ago
- ☆11Sep 26, 2022Updated 3 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- ☆10Oct 16, 2025Updated 4 months ago
- A fine multimodality fusion network :)☆11Aug 9, 2021Updated 4 years ago
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- pytorch+bert实现的意图识别与槽位填充☆11May 30, 2023Updated 2 years ago
- Processing ECG Signal, QRS and ST Segment Detection, BPM Calculation, ST Slope Measurement and Myocardial Ischemia Detection.☆12Jun 27, 2020Updated 5 years ago
- Small extensions of the Bellman-Ford routines in NetworkX, primarily for convenience☆13May 7, 2018Updated 7 years ago
- Exposure-slot: Exposure-centric representations learning with Slot-in-Slot Attention for Region-aware Exposure Correction, Computer Visi…☆21Sep 2, 2025Updated 6 months ago
- [COLING 2024] SentiCSE: A Sentiment-aware Contrastive Sentence Embedding Framework with Sentiment-guided Textual Similarity☆13May 8, 2024Updated last year
- 这是一个深度学习的一个小项目,利用卷积神经网络识别猫狗图片☆21May 5, 2022Updated 3 years ago
- Use `outlines` generators with Haystack.☆15Mar 3, 2026Updated last week
- ERP Desktop App base on Flask & Electron☆11Jun 13, 2022Updated 3 years ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- ☆13Feb 8, 2017Updated 9 years ago
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated 11 months ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- ☆10Jan 20, 2024Updated 2 years ago
- Manage mikrotik devices☆14Jan 24, 2023Updated 3 years ago
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- Neural machine translation with Recurrent Deterministic Policy Gradient☆10Aug 18, 2016Updated 9 years ago
- ☆10Apr 18, 2022Updated 3 years ago