andi611 / Mockingjay-Speech-RepresentationView external linksLinks
Official Implementation of Mockingjay in Pytorch
☆56Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for Mockingjay-Speech-Representation
Users that are interested in Mockingjay-Speech-Representation are comparing it to the libraries listed below
Sorting:
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆10Sep 19, 2022Updated 3 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Aug 29, 2024Updated last year
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Jan 27, 2021Updated 5 years ago
- Official Implementation of SERIL in Pytorch☆27Sep 29, 2020Updated 5 years ago
- Feature extractor for DL speech processing.☆66Apr 13, 2022Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Dec 6, 2022Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated last year
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆21Dec 20, 2023Updated 2 years ago
- ☆22Aug 21, 2020Updated 5 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit☆2,527Jun 13, 2025Updated 8 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated last month
- A Streamlit app to add structured tags to a dataset card☆22Jun 30, 2022Updated 3 years ago
- Code:Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Mod…☆25Dec 17, 2019Updated 6 years ago
- Evaluation of a number of loudness meter implementations☆12Aug 28, 2021Updated 4 years ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆11Dec 19, 2025Updated last month
- ☆13Sep 25, 2024Updated last year
- This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".☆10Dec 2, 2024Updated last year
- Digital Signals Theory book and source materials☆33Jan 7, 2026Updated last month
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Siamese network for unsupervised speech representation learning☆11Oct 12, 2018Updated 7 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Apr 20, 2024Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆90Jun 9, 2022Updated 3 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 4 years ago
- An implementation of the Wav2Letter Speech-to-Text model using PyTorch.☆14Mar 8, 2023Updated 2 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year
- Zerospeech Challenge 2021: validation and evaluation software☆12Jun 13, 2022Updated 3 years ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆63May 19, 2023Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345May 15, 2024Updated last year
- BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation☆227Apr 26, 2023Updated 2 years ago
- Training and evaluation code for Re-MOVE models with embedding distillation☆31Jul 6, 2023Updated 2 years ago
- ☆14Mar 25, 2023Updated 2 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Jun 2, 2023Updated 2 years ago