marc-moreaux / audioset_rawView external linksLinks
Download and create a tfreader for the audioset dataset
☆16Apr 16, 2020Updated 5 years ago
Alternatives and similar repositories for audioset_raw
Users that are interested in audioset_raw are comparing it to the libraries listed below
Sorting:
- This repository contains code that was used as an example of how to use Python to download part of the AudioSet dataset and use Tensorflo…☆13Aug 24, 2017Updated 8 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Siamese network for unsupervised speech representation learning☆11Oct 12, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- ☆12Oct 2, 2020Updated 5 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Text normalization scripts from IRISA lab☆14Jun 1, 2018Updated 7 years ago
- Conferencing Speech Challenge☆95Apr 6, 2021Updated 4 years ago
- Subband PCA feature calculation☆16Nov 5, 2018Updated 7 years ago
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Sep 18, 2020Updated 5 years ago
- The source code for target sound detection☆15Feb 26, 2022Updated 3 years ago
- Phoneme recognizer based on long temporal context (with ALIZE VAD command added)☆17Apr 7, 2012Updated 13 years ago
- Demo for Neural Spatio-Temporal Beamformer for Target Speech Separation accepted to INTERSPEECH2020☆16Oct 20, 2020Updated 5 years ago
- Automatic Dialect Detection Repository☆39Nov 13, 2022Updated 3 years ago
- Bachelor's thesis carried at Universitat Politecnica de Catalunya in partial fullfilment of the requirements for the degree in Telecommun…☆16Jul 25, 2024Updated last year
- ☆17Feb 14, 2020Updated 6 years ago
- Dialect identification using Siamese network☆15Dec 12, 2017Updated 8 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Mar 7, 2021Updated 4 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Oct 8, 2018Updated 7 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- ☆47Aug 30, 2024Updated last year
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 5 years ago
- University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.☆50May 1, 2019Updated 6 years ago
- ICASSP2019 Tutorial: Detection and Classification of Acoustic Scenes and Events / Code examples☆42Jun 3, 2025Updated 8 months ago
- ☆22Mar 22, 2017Updated 8 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆24May 14, 2021Updated 4 years ago
- A spoken question answering dataset on SQUAD☆50May 3, 2025Updated 9 months ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- ☆24Sep 25, 2018Updated 7 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year