Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurrent neural network (MTF-CRNN) for audio event detection. Our goal is to improve audio event detection performance and recognize target audio events that have different lengths and accompany the complex audio back…
☆23Apr 15, 2020Updated 5 years ago
Alternatives and similar repositories for MTF-CRNN
Users that are interested in MTF-CRNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jun 2, 2019Updated 6 years ago
- ☆54Jun 3, 2020Updated 5 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Sep 18, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An implementation of capsule routing for sound event detection☆15Jan 29, 2019Updated 7 years ago
- SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, accepted in ICASSP 2019☆18Feb 20, 2019Updated 7 years ago
- ☆17Feb 14, 2020Updated 6 years ago
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- Language modelling for sound event detection☆20Jan 2, 2020Updated 6 years ago
- Visualization toolbox for Sound Event Detection☆123Feb 26, 2024Updated 2 years ago
- ☆14Oct 2, 2017Updated 8 years ago
- ☆11Apr 20, 2020Updated 5 years ago
- SSLCL: An Efficient Model-Agnostic Supervised Contrastive Learning Framework for Emotion Recognition in Conversations☆15Jul 27, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆16Apr 11, 2019Updated 6 years ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆393Jun 16, 2021Updated 4 years ago
- Predict prosody labels for Chinese sentences.☆42Jul 7, 2022Updated 3 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Dec 12, 2020Updated 5 years ago
- new cycleGAN which has two discriminators: D_class and D_defect☆10Jun 19, 2019Updated 6 years ago
- Direction of arrival estimation algorithms in the spherical harmonics domain☆13Oct 3, 2018Updated 7 years ago
- Code for Yun Wang's PhD Thesis: Polyphonic Sound Event Detection with Weak Labeling☆169May 14, 2022Updated 3 years ago
- A low cost Arduino implementation of an AIS receiver displayed in radar like style on a Raspberry Pi Pico☆13May 15, 2023Updated 2 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆67Nov 28, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Oct 6, 2022Updated 3 years ago
- ☆13May 9, 2022Updated 3 years ago
- C++版本的汉字转拼音 Transfer chinese character to pinyin☆15Aug 31, 2018Updated 7 years ago
- Subband averaging kurtogram (SAK), incorporating with dual-tree complex wavelet packet transform (DTCWPT), to improve performance of the …☆12May 5, 2021Updated 4 years ago
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- 4th position solution to the MediaEval - The 2019 Emotion and Themes in Music using Jamendo☆15Nov 13, 2019Updated 6 years ago
- ☆20Apr 11, 2019Updated 6 years ago
- Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplit…☆129Aug 12, 2020Updated 5 years ago
- ☆30Nov 9, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official repo for the STRFNet system appeared in INTERSPEECH2020☆12Mar 6, 2021Updated 5 years ago
- Contains the implementation of the EDAIN and EDAIN-KL methods proposed in our paper. The research was also part of the thesis I wrote as …☆16Feb 19, 2024Updated 2 years ago
- This repository is webrtc agc module demo.☆12Jan 23, 2019Updated 7 years ago
- This is the code for EEGDnet: Fusing non-local and local self-similarity for EEG signal denoising with transformer☆17Jun 3, 2024Updated last year
- 整理出来的webrtc波束模块☆40Apr 7, 2021Updated 5 years ago
- Continual Learning Benchmark for Spoken Keyword Spotting☆17Jun 7, 2022Updated 3 years ago
- 基于DeepConvLSTM的传感器信号分类☆11May 15, 2018Updated 7 years ago