andi611/Mockingjay-Speech-Representation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/andi611/Mockingjay-Speech-Representation)

andi611 / Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

☆55

Alternatives and similar repositories for Mockingjay-Speech-Representation

Users that are interested in Mockingjay-Speech-Representation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CVxTz / COLA_pytorch
View on GitHub
COLA contrastive pre-training method implemented in PyTorch
☆44Jan 27, 2021Updated 5 years ago
sungnyun / ARMHuBERT
View on GitHub
(Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT
☆41Aug 29, 2024Updated last year
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
bepierre / SpeechVGG
View on GitHub
Feature extractor for DL speech processing.
☆66Apr 13, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,557Mar 12, 2026Updated 4 months ago
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
koudounasalkis / Audio-Speech-Tutorial
View on GitHub
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
☆19Dec 20, 2023Updated 2 years ago
AlanBaade / MAE-AST-Public
View on GitHub
Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
☆93Jun 9, 2022Updated 4 years ago
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 5 years ago
yistLin / human-evaluation
View on GitHub
☆23Aug 21, 2020Updated 5 years ago
gallipoligiuseppe / TST-CycleGAN
View on GitHub
This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".
☆11Dec 2, 2024Updated last year
K-STMLab / SSL4PR
View on GitHub
This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…
☆12Dec 19, 2025Updated 7 months ago
gary083 / GAN_Harmonized_with_HMMs
View on GitHub
Code：Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Mod…
☆25Dec 17, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
etzinis / biased_separation
View on GitHub
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
☆14Nov 16, 2020Updated 5 years ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
maxencemayrand / beat-tracking
View on GitHub
Improving beat tracking algorithms with recurrent neural networks.
☆11Jan 7, 2019Updated 7 years ago
ChangLee0903 / SERIL
View on GitHub
Official Implementation of SERIL in Pytorch
☆27Sep 29, 2020Updated 5 years ago
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
WangHelin1997 / SpecAugment-plus
View on GitHub
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆34Jun 25, 2021Updated 5 years ago
GATECH-EIC / S3-Router
View on GitHub
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…
☆17Sep 19, 2023Updated 2 years ago
raotnameh / End-to-end-E2E-Named-Entity-Recognition-from-English-Speech
View on GitHub
☆32Dec 2, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Alexander-H-Liu / NPC
View on GitHub
Non-Autoregressive Predictive Coding
☆51Nov 3, 2020Updated 5 years ago
qiujiali / lattice_rnn
View on GitHub
Bi-directional Lattice Recurrent Neural Networks for Confidence Estimation
☆15Aug 28, 2020Updated 5 years ago
koudounasalkis / AI4Voice
View on GitHub
This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024
☆15Jun 11, 2024Updated 2 years ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
nttcslab / byol-a
View on GitHub
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
☆237Apr 26, 2023Updated 3 years ago
SunnyCYC / drum-aware4beat
View on GitHub
☆11Dec 17, 2025Updated 7 months ago
huggingface / datasets-tagging
View on GitHub
A Streamlit app to add structured tags to a dataset card
☆23Jun 30, 2022Updated 4 years ago
pzelasko / kaldialign
View on GitHub
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆70Jun 15, 2026Updated last month
pariajm / e2e-asr-and-disfluency-removal-evaluator
View on GitHub
A new metric for evaluating end-to-end speech recognition and disfluency removal systems
☆19Mar 7, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ga642381 / Spoken-Dialogue-Model-Survey
View on GitHub
A survey of spoken dialogue models (SDMs) with speech input and speech output. Focus on their Intermediate Representation and Generation …
☆31Mar 24, 2026Updated 4 months ago
JinhuaLiang / lam4fsl
View on GitHub
An official repo for the paper "Adapting Language-Audio Models as Few-Shot Audio Learners"
☆31May 31, 2023Updated 3 years ago
jimbozhang / xares-llm-template
View on GitHub
Template for creating audio encoders compatible with X-ARES
☆19Feb 11, 2026Updated 5 months ago
bootphon / abnet3
View on GitHub
Siamese network for unsupervised speech representation learning
☆11Oct 12, 2018Updated 7 years ago
felixgontier / dcase-2023-baseline
View on GitHub
☆14Mar 25, 2023Updated 3 years ago
SunnyCYC / plpdp4beat
View on GitHub
☆11Jan 2, 2025Updated last year
csteinmetz1 / pyloudnorm-eval
View on GitHub
Evaluation of a number of loudness meter implementations
☆13Aug 28, 2021Updated 4 years ago