In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.
☆43Jan 11, 2025Updated last year
Alternatives and similar repositories for audio-classification-pytorch
Users that are interested in audio-classification-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Nov 5, 2022Updated 3 years ago
- ☆11May 9, 2022Updated 4 years ago
- ☆13May 27, 2022Updated 4 years ago
- ☆25Aug 2, 2022Updated 3 years ago
- In this repository, I aim at providing theoretical and practical notes for fully understanding Yolo models. Then, I show how to label a d…☆11Aug 22, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- presenting codes and notes from Elasticsearch 7.0 Cookbook over 100 recipes for fast, scalable, and reliable search for your enterprise, …☆13May 27, 2022Updated 4 years ago
- A simple OCR labeling tool built on Flask and PyQt5☆24Dec 6, 2022Updated 3 years ago
- In this repository, a complete fine-tuning process of HooshvareLab/roberta-fa-zwnj-base over ArmanPers dataset using Transformers library…☆13Apr 26, 2022Updated 4 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- The deep_utils' notebooks are stored in this repository☆19Jun 27, 2022Updated 3 years ago
- A simple hello-world project using kubernetes on minicube☆13Feb 16, 2022Updated 4 years ago
- Today I learned contains Q&A of what I have learned so far. It encompasses topics like: python, deep-learning, cuda-installation, docker,…☆23May 18, 2026Updated last week
- We present our facial expression recognition models for fer-2013 dataset☆24May 27, 2022Updated 4 years ago
- This is a simple dockerfile for running tflite without installing TensorFlow☆17Dec 26, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A complete instruction for training a Persian spell checker and a language model based on SymSpell and KenLM, respectively using Wikipedi…☆35Jul 20, 2022Updated 3 years ago
- Converting coco-like annotation json files to png masks☆26Jun 1, 2022Updated 3 years ago
- ☆20Sep 18, 2021Updated 4 years ago
- Converting Vox files to Wav or other formats that are easy to workaround.☆23Dec 23, 2021Updated 4 years ago
- pytorch implementation of crnn. A sample training of license plate is provided.☆48Oct 14, 2023Updated 2 years ago
- ☆17Nov 3, 2021Updated 4 years ago
- In this repository, I share codes of the introduction to python courses published on my YouTube channel☆42Dec 23, 2022Updated 3 years ago
- This is Pooya Mohammadi, Open Source Enthusiast, AI Developer & Researcher☆28Nov 11, 2024Updated last year
- This is a minimal implementation of face-detection models using flask, gunicorn, nginx, docker, and docker-compose☆38Sep 12, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Advanced Deep Learning☆34Dec 23, 2021Updated 4 years ago
- ☆28Dec 29, 2022Updated 3 years ago
- Generative AI based image editing/inpainting made super easy to work with.☆20Nov 5, 2023Updated 2 years ago
- U-Net-based Models for Skin Lesion Segmentation: More Attention and Augmentation☆34Apr 16, 2025Updated last year
- Introduction to Deep Learning☆24Oct 2, 2019Updated 6 years ago
- Visualizing Yolov5's layers using GradCam☆297Nov 27, 2023Updated 2 years ago
- Sharif Emotional Speech Database☆39Jan 9, 2021Updated 5 years ago
- Will tidy your sass and scss☆12Jul 12, 2018Updated 7 years ago
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆21Jul 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A web app built with Flask to stream video feed from any IP camera to the users of a local network☆14Dec 23, 2021Updated 4 years ago
- 日本音響学会誌用BibTeXスタイルファイル☆11Jan 24, 2022Updated 4 years ago
- DNS utils module for bepass sdk, supporting doh, dot, dnscrypt and static hosts file like configuration☆16Jan 13, 2024Updated 2 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Jul 25, 2024Updated last year
- official implementation of MGA-CLAP (ACM MM 2024)☆31Oct 25, 2024Updated last year
- a deep learning method to detect 68 landmarks☆15Aug 1, 2018Updated 7 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 4 years ago