multitel-ai / urban-sound-taggingView external linksLinks
1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context
☆16Dec 8, 2022Updated 3 years ago
Alternatives and similar repositories for urban-sound-tagging
Users that are interested in urban-sound-tagging are comparing it to the libraries listed below
Sorting:
- ☆12Oct 2, 2020Updated 5 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Mar 19, 2021Updated 4 years ago
- Urban Sound Classification : striving towards a fair comparison☆17Dec 11, 2020Updated 5 years ago
- SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, accepted in ICASSP 2019☆18Feb 20, 2019Updated 6 years ago
- Download and create a tfreader for the audioset dataset☆16Apr 16, 2020Updated 5 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Dec 12, 2020Updated 5 years ago
- ☆20May 13, 2019Updated 6 years ago
- The python implementation for paper "Towards Discriminative Representation Learning for Speech Emotion Recognition" in IJCAI-2019☆23Aug 12, 2019Updated 6 years ago
- PyTorch Implementation of SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, a…☆22Feb 20, 2019Updated 6 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Jan 31, 2018Updated 8 years ago
- Learning discriminative and robust time-frequency representations for environmental sound classification: Convolutional neural networks (…☆30Dec 19, 2019Updated 6 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- an tutorial implement of voice conversion using pytorch☆34Mar 30, 2018Updated 7 years ago
- Multiple Instance Learning for Sound Event Detection☆34Apr 20, 2018Updated 7 years ago
- ☆13Updated this week
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- Listen to the weather using Sonic Pi and data from Mathematica☆11Dec 6, 2018Updated 7 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- Deep learning and standard machine learning methods are developed and compared in classfying audio samples from microphones deployed abo…☆11Jan 17, 2020Updated 6 years ago
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆14Oct 8, 2016Updated 9 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago
- Github mirror of MediaWiki extension Wikispeech - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Develo…☆12Updated this week
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆38Dec 16, 2024Updated last year
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- Audio captioning baseline system for DCASE 2020 challenge.☆38Aug 22, 2023Updated 2 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- Pytorch code for the paper 'Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acousti…☆14Nov 12, 2020Updated 5 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- Hybrid GAN (HiFi-WaveGAN) applied to footsteps sound effects☆12Jul 17, 2023Updated 2 years ago
- Semi-automated process to create an audiobook (m4b format) from markdown files.☆11Jan 12, 2017Updated 9 years ago
- VGG-Face CNN descriptor in PyTorch.☆38Jan 21, 2021Updated 5 years ago