mispchallenge/MISP2021-AVSR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mispchallenge/MISP2021-AVSR)

mispchallenge / MISP2021-AVSR

repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"

☆18

Alternatives and similar repositories for MISP2021-AVSR

Users that are interested in MISP2021-AVSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mispchallenge / misp2021_baseline
View on GitHub
☆29Jun 15, 2022Updated 4 years ago
mispchallenge / misp2022_baseline
View on GitHub
☆33Jun 26, 2023Updated 3 years ago
mispchallenge / MISP-ICME-AVSR
View on GitHub
☆17Jan 1, 2024Updated 2 years ago
LUMIA-Group / Leveraging-Self-Supervised-Learning-for-AVSR
View on GitHub
Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…
☆67Jul 13, 2022Updated 4 years ago
jqi41 / Gfcc
View on GitHub
Gammatone feature for robust speech recognition
☆14Aug 1, 2016Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chenzhehuai / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆13Jun 5, 2019Updated 7 years ago
bliunlpr / Robust_e2e_gan
View on GitHub
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 7 years ago
jiay7 / wenet_onlinedecode
View on GitHub
Went online decode demo
☆31Apr 28, 2021Updated 5 years ago
lenovo-voice / THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEM
View on GitHub
☆50Dec 26, 2020Updated 5 years ago
placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
wq2012 / CurriculumVitae
View on GitHub
Curriculum Vitae of Quan Wang
☆15Updated this week
justinjohn0306 / TTS-TT2
View on GitHub
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆11Mar 26, 2023Updated 3 years ago
qiuqiangkong / sampleRNN_acoustic_scene_generation
View on GitHub
☆14Apr 18, 2019Updated 7 years ago
sungnyun / cav2vec
View on GitHub
(ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
☆16Apr 29, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Mashiro009 / wenet-online-decoder-onnx
View on GitHub
☆40Aug 15, 2021Updated 4 years ago
zoezou2015 / abs_pretraining
View on GitHub
☆10Apr 28, 2021Updated 5 years ago
KawhiZhao / Egocentric-Audio-Visual-Speaker-Localization
View on GitHub
Code for paper Audio Visual Speaker Localization from EgoCentric Views
☆11Jul 3, 2024Updated 2 years ago
yufan-aslp / AliMeeting
View on GitHub
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆142Jun 10, 2022Updated 4 years ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
double22a / asr_nlp_paper_code
View on GitHub
Papers of ASR, Tools of ASR
☆41Feb 14, 2025Updated last year
funcwj / setk
View on GitHub
Tools for Speech Enhancement integrated with Kaldi
☆432Jul 6, 2023Updated 3 years ago
facebookresearch / fbai-speech
View on GitHub
Repo for the FB AI Speech team.
☆27Aug 24, 2021Updated 4 years ago
idiap / torgo_asr
View on GitHub
A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech
☆17Sep 22, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
talhanai / wer-sigtest
View on GitHub
Script to perform statistical significance test between ASR hypotheses.
☆23Aug 13, 2017Updated 8 years ago
suhaspillai / Speech-Recognition-Impaired-Speech
View on GitHub
Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…
☆23Mar 26, 2017Updated 9 years ago
xiangxyq / minimize-chain-decoder
View on GitHub
Minimize kaldi nnet3 chain decoder
☆45Jan 10, 2020Updated 6 years ago
bigcash / spleeter-pytorch-mnn
View on GitHub
convert spleeter pretrained model to pytorch and onnx, then convert to mnn
☆21Dec 17, 2020Updated 5 years ago
sony / audio-visual-seld-dcase2023
View on GitHub
Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge
☆68Mar 19, 2025Updated last year
Yuanshi9815 / LiteFocus
View on GitHub
[Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.
☆34Mar 11, 2025Updated last year
jingyonghou / KWS_Max-pooling_RHE
View on GitHub
Mining effective negative training samples for keyword spotting (PyTorch)
☆66May 23, 2020Updated 6 years ago
stdereka / liverpool-ion-switching
View on GitHub
Liverpool Ion Switching kaggle competition 2nd place winning solution
☆16Mar 25, 2023Updated 3 years ago
whl97 / LS-Score
View on GitHub
☆15Nov 24, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
stoneMo / SLAVC
View on GitHub
Official Codebase of "A Closer Look at Weakly-Supervised Audio-Visual Source Localization" (NeurIPS 2022)
☆22Dec 6, 2022Updated 3 years ago
OpenNLPLab / MMVAE-AVS
View on GitHub
Multimodal Variational Auto-encoder based Audio-Visual Segmentation [ICCV2023].
☆20Sep 19, 2024Updated last year
facebookresearch / MMCSG
View on GitHub
This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …
☆41Mar 13, 2024Updated 2 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
l3das / L3DAS23
View on GitHub
Official repository supporting the L3DAS23 IEEE ICASSP Grand Challenge
☆16Feb 10, 2023Updated 3 years ago
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago