RicherMans / AudioCaption
Dataset and baseline for the first Audiocaption task
☆79Updated 8 months ago
Alternatives and similar repositories for AudioCaption:
Users that are interested in AudioCaption are comparing it to the libraries listed below
- ☆36Updated 2 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated 2 years ago
- E2E-SincNet: Toward fully end-to-end speech recognition☆30Updated 5 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- Non-Autoregressive Predictive Coding☆50Updated 4 years ago
- ☆53Updated 4 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆38Updated 4 years ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated 3 months ago
- ☆68Updated 3 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Updated 3 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- Memory efficient transducer loss computation☆68Updated 2 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆141Updated last year
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Updated 2 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Updated 5 years ago
- A list of resources that can help in research for automated audio captioning☆34Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆59Updated 2 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- A list of papers about audio captioning☆77Updated 2 years ago
- **Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…☆99Updated last week
- Alignment files of LibriTTS.☆61Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Updated 5 years ago
- A collection of papers related to speech model compression☆24Updated last year
- Implementation of CTC alignment-based single step non-autoregressive transformer☆13Updated last year
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Updated 4 years ago