Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurrent neural network (MTF-CRNN) for audio event detection. Our goal is to improve audio event detection performance and recognize target audio events that have different lengths and accompany the complex audio back…
☆23Apr 15, 2020Updated 6 years ago
Alternatives and similar repositories for MTF-CRNN
Users that are interested in MTF-CRNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jun 2, 2019Updated 6 years ago
- ☆55Jun 3, 2020Updated 5 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆14Sep 18, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An implementation of capsule routing for sound event detection☆15Jan 29, 2019Updated 7 years ago
- SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, accepted in ICASSP 2019☆18Feb 20, 2019Updated 7 years ago
- Download and create a tfreader for the audioset dataset☆17Apr 16, 2020Updated 6 years ago
- Language modelling for sound event detection☆20Jan 2, 2020Updated 6 years ago
- Visualization toolbox for Sound Event Detection☆123Feb 26, 2024Updated 2 years ago
- ☆14Oct 2, 2017Updated 8 years ago
- SSLCL: An Efficient Model-Agnostic Supervised Contrastive Learning Framework for Emotion Recognition in Conversations☆15Jul 27, 2024Updated last year
- ☆16Apr 11, 2019Updated 7 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆393Jun 16, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Dec 12, 2020Updated 5 years ago
- Direction of arrival estimation algorithms in the spherical harmonics domain☆13Oct 3, 2018Updated 7 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆168May 14, 2022Updated 4 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Oct 6, 2022Updated 3 years ago
- ☆13May 9, 2022Updated 4 years ago
- C++版本的汉字转拼音 Transfer chinese character to pinyin☆15Aug 31, 2018Updated 7 years ago
- Subband averaging kurtogram (SAK), incorporating with dual-tree complex wavelet packet transform (DTCWPT), to improve performance of the …☆12May 5, 2021Updated 5 years ago
- Open rotating mechanical fault datasets (开源旋转机械故障数据集整理)☆22Aug 10, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- 4th position solution to the MediaEval - The 2019 Emotion and Themes in Music using Jamendo☆15Nov 13, 2019Updated 6 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆130Aug 12, 2020Updated 5 years ago
- ☆30Nov 9, 2018Updated 7 years ago
- Official repo for the STRFNet system appeared in INTERSPEECH2020☆12Mar 6, 2021Updated 5 years ago
- Contains the implementation of the EDAIN and EDAIN-KL methods proposed in our paper. The research was also part of the thesis I wrote as …☆16Feb 19, 2024Updated 2 years ago
- This is the code for EEGDnet: Fusing non-local and local self-similarity for EEG signal denoising with transformer☆17Jun 3, 2024Updated last year
- This repository is webrtc agc module demo.☆12Jan 23, 2019Updated 7 years ago
- 整理出来的webrtc波束模块☆40Apr 7, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICASSP'22] Continual Learning Benchmark for Spoken Keyword Spotting☆17Jun 7, 2022Updated 3 years ago
- Emotion-Classification-by-EEG-DEAP-Dataset implemented in 2DCNNN-LSTM-1DCNN+GRU and the 1D_cnn+gru model gives the highest accuracy☆11May 26, 2023Updated 2 years ago
- ☆12May 30, 2019Updated 6 years ago
- ☆21Apr 11, 2019Updated 7 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆44Nov 10, 2021Updated 4 years ago
- ☆20May 13, 2019Updated 7 years ago
- Audio Novelty Detection☆14Nov 20, 2018Updated 7 years ago