We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networks (CNN1D), and CNN2D in this repository. We undertake some basic data preprocessing and feature extraction on audio sources before developing models. As a result, the accuracy, training time, and prediction ti…
☆61Mar 8, 2022Updated 4 years ago
Alternatives and similar repositories for Audio-Classification-Deep-Learning
Users that are interested in Audio-Classification-Deep-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official submission from Speech Squad team for the MTC-AIC 2 competition of 2024 where an ASR model is developed tailored for the Egy…☆18Mar 9, 2026Updated 3 months ago
- Environmental sound classification with Convolutional neural networks and the UrbanSound8K dataset.☆74Apr 25, 2021Updated 5 years ago
- MIMII Sound Anomaly Detection with AutoEncoders☆40Jul 9, 2021Updated 4 years ago
- This repository contains a YoloV4/Darknet based image classifier coded to run onboard the Nvidia Jetson Nano platform at approximately 10…☆14Aug 17, 2021Updated 4 years ago
- Mendeteksi bahasa isyarat alfabet tangan dalam format sibi☆17Jan 31, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Creating a Yoga pose classification using Mediapipe with help of OpenCV☆20Sep 13, 2022Updated 3 years ago
- End-to-End Arabic ASR using DeepSpeech engine☆14Nov 2, 2021Updated 4 years ago
- Audio feature extraction and multi-classification with the ECS-10 data set☆21Jun 7, 2018Updated 8 years ago
- Natural Language processing in tensorflow☆15Apr 11, 2022Updated 4 years ago
- Classification of audio signals using PyTorch☆13May 19, 2020Updated 6 years ago
- This is my PyTorch implementation of the "Very Deep Convolutional Neural Networks For Raw Waveforms" research paper published in 2016.☆17Aug 24, 2021Updated 4 years ago
- Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks☆107Aug 2, 2024Updated last year
- Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augment…☆43Dec 14, 2022Updated 3 years ago
- ☆17Sep 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- this is complete crud api with auth jwt token☆11Jan 21, 2022Updated 4 years ago
- Speech Dereverberation using weighted prediction error☆11Dec 22, 2019Updated 6 years ago
- This repository contains the revised version of the NLP lab at HCMUT. The lab is designed to help students understand the basic concepts …☆20Jan 11, 2025Updated last year
- jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2☆19Aug 15, 2025Updated 10 months ago
- Project developed for the Computer Vision course unit in FEUP☆10Apr 27, 2021Updated 5 years ago
- Dynamic Time Warping algorithm for the Physionet Challenge 2016☆17Nov 10, 2016Updated 9 years ago
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Aug 4, 2024Updated last year
- Python implementation of PayNow QR Code Generator☆20Dec 2, 2022Updated 3 years ago
- Quy Nhon AI Hackathon 2022 - Challenge 2: Review Analytics - Top 1 Solution☆11Sep 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Dec 15, 2023Updated 2 years ago
- Building a Linear Heatmap for Routes☆11Nov 2, 2023Updated 2 years ago
- Dotnet core static blog generator based on markdown, YAML front matter and Handlebars.NET☆11Apr 5, 2020Updated 6 years ago
- It's an end-to-end Machine Learning Project. The prediction has been done by using Machine Learning (ML) classification algorithms and it…☆12Nov 26, 2022Updated 3 years ago
- This project leverages YOLO (You Only Look Once) to detect various traffic violations in real-time, aimed at improving road safety and co…☆19Feb 12, 2025Updated last year
- Arabic Text to Speech☆18Jun 3, 2015Updated 11 years ago
- xLSTMAD - Powerful xLSTM based Method for Anomaly Detection☆19Apr 27, 2026Updated 2 months ago
- ☆32Oct 29, 2024Updated last year
- A course in numerical methods with Python for engineers and scientists: currently 5 learning modules, with student assignments.☆10Dec 6, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- VachanaTTS เป็นเครื่องมือ Text-to-Speech (TTS) ที่ใช้โมเดล VITS สำหรับสร้างเสียงพูดจากข้อความในภ าษาไทย,รองรับการโคลนเสียง,พอดแคสต์,การพาก…☆26Sep 14, 2025Updated 9 months ago
- Implementation of USAD (UnSupervised Anomaly Detection on multivariate time series) in PyTorch Lightning☆21Oct 15, 2021Updated 4 years ago
- Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch☆15May 30, 2023Updated 3 years ago
- This is the RobEn AI's team home made Discord bot. Custom made for the AI team Discord server, to serve.☆16May 26, 2021Updated 5 years ago
- ☆10Jul 16, 2025Updated 11 months ago
- GA Project 5 (Capstone Project): Using Neural Networks (BERT) with Legal NLP for Contract Clause Classification in real-life clauses☆14Aug 17, 2020Updated 5 years ago
- Code for YouTube series: Deep Learning for Audio Classification☆588Feb 6, 2023Updated 3 years ago