Sindhu-Hegde/pseudo-visual-speech-denoising

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Sindhu-Hegde/pseudo-visual-speech-denoising)

Sindhu-Hegde / pseudo-visual-speech-denoising

Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021

☆108

Alternatives and similar repositories for pseudo-visual-speech-denoising

Users that are interested in pseudo-visual-speech-denoising are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Sid2697 / Word-recognition-EmbedNet-CAB
View on GitHub
Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
☆21May 21, 2021Updated 5 years ago
prajwalkr / transpotter
View on GitHub
Official implementation of Transpotter, published in BMVC 2021
☆16Aug 6, 2022Updated 3 years ago
Rudrabha / LipGAN
View on GitHub
This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Transla…
☆616Jun 22, 2025Updated last year
Sindhu-Hegde / gestsync
View on GitHub
Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023
☆48Sep 1, 2024Updated last year
Rudrabha / 8X-Super-Resolution
View on GitHub
This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…
☆16Aug 26, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dr-pato / audio_visual_speech_enhancement
View on GitHub
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆112Mar 19, 2024Updated 2 years ago
karanjakhar / yolov5-export-to-cpu
View on GitHub
Export yolov5 model to run on cpu using tflite
☆14Aug 12, 2021Updated 4 years ago
Rudrabha / Lip2Wav
View on GitHub
This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech S…
☆713Jul 6, 2023Updated 3 years ago
vskadandale / vocalist
View on GitHub
Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices
☆73Apr 7, 2024Updated 2 years ago
prajwalkr / transpeller
View on GitHub
Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.
☆12Jun 22, 2023Updated 3 years ago
danmic / av-se
View on GitHub
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
☆222Apr 16, 2023Updated 3 years ago
MiuLab / SpokenVec
View on GitHub
Learning ASR-Robust Contextualized Embeddings for Spoken Language Understanding
☆24Dec 8, 2022Updated 3 years ago
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
deepakbaby / se_relativisticgan
View on GitHub
Keras framework for speech enhancement using relativistic GANs
☆52Jun 24, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
avijit9 / CleanAdapt
View on GitHub
Code for our Source-free Unsupervised Video Domain Adaptation Paper
☆13Jan 17, 2025Updated last year
ms-dot-k / LRW_ID
View on GitHub
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…
☆10Oct 12, 2023Updated 2 years ago
georgesterpu / Taris
View on GitHub
Transformer-based online speech recognition system with TensorFlow 2
☆26Jan 22, 2021Updated 5 years ago
lusensama / Obamanet_retrain
View on GitHub
ObamaNet fork
☆12Sep 16, 2019Updated 6 years ago
kapoorparul / Towards-Automatic-Speech-to-SL
View on GitHub
☆17Oct 15, 2021Updated 4 years ago
sayandebroy-csmi / cleanadapt
View on GitHub
Reproduced code for Overcoming Label Noise for Source-free Unsupervised Video Domain Adaptation, ICVGIP'22
☆22Jun 8, 2024Updated 2 years ago
Tinglok / CVC
View on GitHub
CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
☆58Jul 26, 2022Updated 4 years ago
Sindhu-Hegde / speaker-separation
View on GitHub
Code for the cocktail-party problem of isolating and enhancing the speech for the target speaker
☆18Mar 11, 2022Updated 4 years ago
Kazuhito00 / simple-virtual-mouse-using-mediapipe
View on GitHub
MediaPipeを用いたハンドジェスチャーによる簡単なマウス操作を行うプログラムです。
☆12Mar 17, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ms-dot-k / Visual-Audio-Memory
View on GitHub
PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)
☆22Apr 11, 2022Updated 4 years ago
Sid2697 / HOI-Ref
View on GitHub
Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"
☆30Apr 16, 2024Updated 2 years ago
tatsuyai713 / Image-Processing-Node-Editor-ROS2
View on GitHub
処理の検証や比較検討での用途を想定したノードエディターベースの画像処理アプリ
☆11Mar 5, 2023Updated 3 years ago
dtake1336 / ERNN-for-speech-enhancement
View on GitHub
☆38Jul 20, 2020Updated 6 years ago
linan2 / TensorFlow-speech-enhancement-Chinese
View on GitHub
基于深度学习的语音增强、去混响
☆102Jan 30, 2024Updated 2 years ago
michaelzhang-ai / Speech2Video
View on GitHub
ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"
☆100Feb 27, 2026Updated 4 months ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
csukuangfj / kaldi_native_io
View on GitHub
python wrapper for kaldi's native I/O
☆27Jan 9, 2025Updated last year
fedden / TensorFlow-Efficient-Neural-Audio-Synthesis
View on GitHub
☆20Feb 27, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
JuanFMontesinos / Acappella-YNet
View on GitHub
Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21
☆18May 14, 2022Updated 4 years ago
amtsai96 / Learning-Lip-Sync-from-Audio
View on GitHub
Learning Lip Sync of Obama from Speech Audio
☆67Jul 29, 2020Updated 5 years ago
pingponglabs / FaceAnime
View on GitHub
☆10Apr 22, 2021Updated 5 years ago
georgesterpu / avsr-tf1
View on GitHub
Audio-Visual Speech Recognition using Sequence to Sequence Models
☆84Jul 10, 2020Updated 6 years ago
JuanFMontesinos / VoViT
View on GitHub
VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer
☆35Mar 18, 2023Updated 3 years ago
Merterm / Modeling-Intensification-for-SLG
View on GitHub
Public repo for the paper: "Modeling Intensification for Sign Language Generation: A Computational Approach" by Mert Inan*, Yang Zhong*, …
☆14Mar 15, 2022Updated 4 years ago
uark-cviu / Right2Talk
View on GitHub
[ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach
☆20Aug 2, 2021Updated 4 years ago