A python code based on pytorch applied to AudioClassification
☆48Jul 15, 2022Updated 3 years ago
Alternatives and similar repositories for Pytorch-AudioClassification-master
Users that are interested in Pytorch-AudioClassification-master are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a vari…☆595Dec 17, 2025Updated 5 months ago
- Python的音频工具☆16Dec 5, 2025Updated 5 months ago
- Pytorch code for "Rethinking CNN Models for Audio Classification"☆129Mar 25, 2021Updated 5 years ago
- 基于Tensorflow实现声音分类,博客地址:☆106May 8, 2020Updated 6 years ago
- Reinforcement Learning-based Generative Fixed-filter Active Noise Control☆19Sep 15, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TBSK (Trait Block Shift Keying) audio modem Communication Library for Python☆11Mar 26, 2024Updated 2 years ago
- ☆12Mar 30, 2023Updated 3 years ago
- ☆12Feb 25, 2024Updated 2 years ago
- ☆13Jun 13, 2023Updated 2 years ago
- [ICCV'23] PAINet: Parallel Attention Interaction Network for Few-shot Skeleton-based Action Recognition☆11Oct 14, 2023Updated 2 years ago
- Classification of Urban Sound Audio Dataset using LSTM-based model.☆77Oct 11, 2022Updated 3 years ago
- ☆11May 30, 2023Updated 2 years ago
- The official implementation of our work SQLFixAgent: Towards Semantic-Accurate Text-to-SQL Parsing via Consistency-Enhanced Multi-Agent C…☆24May 2, 2025Updated last year
- Raw waveform adaptation with SincNet☆12Mar 19, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆84Jul 31, 2020Updated 5 years ago
- Wave U Net (NNabla)☆13Jul 1, 2020Updated 5 years ago
- This is the release code for CVPR2022 paper "Voice-Face Homogeneity Tells Deepfake".☆15Mar 7, 2022Updated 4 years ago
- Code repository for ‘Adaptive Differential Denoising for Respiratory Sounds Classification’☆22Dec 19, 2025Updated 5 months ago
- This repository collects papers related to Speech Tokenizer.☆18Oct 16, 2024Updated last year
- A Python wrapper for GGWave – a data-over-sound communication library.☆25Feb 25, 2025Updated last year
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- Classification of animal sounds in a hyperdiverse rainforest using Convolutional Neural Networks (Sun et al, 2021)☆13Oct 16, 2023Updated 2 years ago
- The official implementation of ACL 2023 paper "Label-Aware Hyperbolic Embeddings for Fine-grained Emotion Classification."☆22Jun 30, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning☆19May 31, 2025Updated 11 months ago
- A repository of the latest work related to underwater image enhancement (awaiting continuous updates). It provides relevant underwater im…☆22May 14, 2026Updated last week
- MVVM kotlin CC 组件化开发☆11Mar 13, 2020Updated 6 years ago
- We have implemented Track # 1 for ICME 2024: Spatial Action Localization on Chaotic World dataset. Our mAP on the validation set reaches …☆12Nov 11, 2024Updated last year
- AI Music Structure Analyzer + Stem Splitter using Demucs & Mdx-Net with Python-Audio-Separator | Cog | Replicate☆13Mar 3, 2024Updated 2 years ago
- ☆22Jul 16, 2024Updated last year
- Official implementation of Hierarchical Spectrogram Transformers (HST)☆20Oct 10, 2022Updated 3 years ago
- Diffusers++: State-of-the-art diffusion models for image and audio generation in PyTorch☆14Sep 18, 2024Updated last year
- A python implementation of “Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization” [TASLP 2021]☆27Feb 11, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Jul 15, 2024Updated last year
- real-time speech enhance☆17Jan 23, 2024Updated 2 years ago
- In this project, I used a deep neural network (built with Keras) to clone car driving behavior. The dataset used to train the network is …☆14Aug 6, 2018Updated 7 years ago
- ☆15Dec 22, 2021Updated 4 years ago
- [ITSC'25] LLM-Guided Evaluation and Adversarial Generation of Safety-Critical Driving Scenarios☆27Aug 29, 2025Updated 8 months ago
- This repository presents FSD dataset for song deepfake detection.☆25Aug 18, 2025Updated 9 months ago
- Use AI to edit image in Claude Desktop / Cursor (AI P图)☆18Mar 19, 2025Updated last year