pooya-mohammadi / audio-classification-pytorchView external linksLinks
In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.
☆43Jan 11, 2025Updated last year
Alternatives and similar repositories for audio-classification-pytorch
Users that are interested in audio-classification-pytorch are comparing it to the libraries listed below
Sorting:
- ☆25Aug 2, 2022Updated 3 years ago
- pytorch lightening image classification☆17Dec 29, 2022Updated 3 years ago
- presenting codes and notes from Elasticsearch 7.0 Cookbook over 100 recipes for fast, scalable, and reliable search for your enterprise, …☆13May 27, 2022Updated 3 years ago
- ☆15Dec 10, 2021Updated 4 years ago
- The deep_utils' notebooks are stored in this repository☆19Jun 27, 2022Updated 3 years ago
- In this repository, a complete fine-tuning process of HooshvareLab/roberta-fa-zwnj-base over ArmanPers dataset using Transformers library…☆13Apr 26, 2022Updated 3 years ago
- A simple OCR labeling tool built on Flask and PyQt5☆24Dec 6, 2022Updated 3 years ago
- This repo contains a series of notes for using Git and GitHub. It's aimed at making it easy to work with Git and GitHub practically. I al…☆30Dec 21, 2022Updated 3 years ago
- In this repository, I aim at providing theoretical and practical notes for fully understanding Yolo models. Then, I show how to label a d…☆11Aug 22, 2022Updated 3 years ago
- Today I learned contains Q&A of what I have learned so far. It encompasses topics like: python, deep-learning, cuda-installation, docker,…☆23Jan 25, 2026Updated 3 weeks ago
- This is a simple dockerfile for running tflite without installing TensorFlow☆17Dec 26, 2021Updated 4 years ago
- A simple hello-world project using kubernetes on minicube☆13Feb 16, 2022Updated 4 years ago
- ☆26May 5, 2022Updated 3 years ago
- ☆20Sep 18, 2021Updated 4 years ago
- In this repository, I share codes of the introduction to python courses published on my YouTube channel☆42Dec 23, 2022Updated 3 years ago
- A complete instruction for training a Persian spell checker and a language model based on SymSpell and KenLM, respectively using Wikipedi…☆35Jul 20, 2022Updated 3 years ago
- pytorch implementation of crnn. A sample training of license plate is provided.☆47Oct 14, 2023Updated 2 years ago
- Converting Vox files to Wav or other formats that are easy to workaround.☆24Dec 23, 2021Updated 4 years ago
- This is Pooya Mohammadi, Open Source Enthusiast, AI Developer & Researcher☆27Nov 11, 2024Updated last year
- added session 02☆20Feb 15, 2020Updated 6 years ago
- Advanced Deep Learning☆35Dec 23, 2021Updated 4 years ago
- This is a minimal implementation of face-detection models using flask, gunicorn, nginx, docker, and docker-compose☆38Sep 12, 2022Updated 3 years ago
- ☆24Mar 25, 2020Updated 5 years ago
- U-Net-based Models for Skin Lesion Segmentation: More Attention and Augmentation☆34Apr 16, 2025Updated 10 months ago
- An open-source toolkit which is full of handy functions, including the most used models and utilities for deep-learning practitioners!☆113Updated this week
- Introduction to Deep Learning☆25Oct 2, 2019Updated 6 years ago
- Visualizing Yolov5's layers using GradCam☆296Nov 27, 2023Updated 2 years ago
- A web app built with Flask to stream video feed from any IP camera to the users of a local network☆14Dec 23, 2021Updated 4 years ago
- Sharif Emotional Speech Database☆39Jan 9, 2021Updated 5 years ago
- From Claims to Evidence: A Unified Framework and Critical Analysis of CNN vs. Transformer vs. Mamba in Medical Image Segmentation.☆21Sep 11, 2025Updated 5 months ago
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆20Jul 27, 2024Updated last year
- official implementation of MGA-CLAP (ACM MM 2024)☆28Oct 25, 2024Updated last year
- A large-scale validated database for Persian speech emotion detection.☆24May 9, 2022Updated 3 years ago
- Code release for TexFit: Text-Driven Fashion Image Editing with Diffusion Models (AAAI 2024)☆29Sep 30, 2024Updated last year
- Tacotron 2 - Persian☆37Dec 28, 2021Updated 4 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 3 years ago
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆50Sep 2, 2025Updated 5 months ago
- Pytorch implementation of "LEVERAGING POSITIONAL-RELATED LOCAL-GLOBAL DEPENDENCY FOR SYNTHETIC SPEECH DETECTION"☆37Jul 24, 2023Updated 2 years ago
- Django React Integration with Session Authentication, CORS, CSRF Mechanism & Cookies Handling.☆10May 9, 2021Updated 4 years ago