voletiv/syncnet-in-keras

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/voletiv/syncnet-in-keras)

voletiv / syncnet-in-keras

Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.

☆51

Alternatives and similar repositories for syncnet-in-keras

Users that are interested in syncnet-in-keras are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

joonson / syncnet_python
View on GitHub
Out of time: automated lip sync in the wild
☆895Apr 17, 2026Updated 3 months ago
amtsai96 / Learning-Lip-Sync-from-Audio
View on GitHub
Learning Lip Sync of Obama from Speech Audio
☆67Jul 29, 2020Updated 6 years ago
ajinkyaT / Lip_Reading_in_the_Wild_AVSR
View on GitHub
Audio-Visual Speech Recognition using Deep Learning
☆61Nov 14, 2018Updated 7 years ago
stevel705 / Tacotron-2-keras
View on GitHub
Keras implementations of Tacotron-2
☆27Jan 22, 2021Updated 5 years ago
hipudding / pytorch-lightning
View on GitHub
Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.
☆13Feb 20, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Kajiyu / LLLNet
View on GitHub
Keras Implementation of "Look, Listen and Learn" Model
☆21Nov 14, 2017Updated 8 years ago
artem179 / WLAS
View on GitHub
The implementation of 'Watch, Listen, Attend and Spell’ (WLAS) network that learns to transcribe videos of mouth motion to character on p…
☆11Mar 23, 2018Updated 8 years ago
mayurnewase / looking-to-listen-at-cocktail-party
View on GitHub
Looking to listen at cocktail party
☆36Mar 24, 2023Updated 3 years ago
sasanasadiabadi / speech_animation
View on GitHub
☆24May 23, 2018Updated 8 years ago
dee-ex / aicovidvn115m
View on GitHub
Giải pháp của nhóm "đi thi", đạt được Hạng 3 vòng Về đích với AUC 0.92 trong cuộc thi AICovidVN115m
☆13Sep 6, 2021Updated 4 years ago
voletiv / lipreading-in-the-wild-experiments
View on GitHub
My experiments in lip reading using deep learning with the LRW dataset
☆54Mar 14, 2021Updated 5 years ago
acvictor / Obama-Lip-Sync
View on GitHub
An implementation of ObamaNet: Photo-realistic lip-sync from text.
☆127Apr 21, 2019Updated 7 years ago
jaekookang / p2fa_py3
View on GitHub
Penn Phonetics Lab Forced Aligner Toolkit (P2FA) for Python3
☆107Feb 27, 2024Updated 2 years ago
robertanto / bob_telegram_tools
View on GitHub
Bob Telegram Tools is a python library that allows you to monitor your machine learning methods just by using Telegram without any additi…
☆11Jul 10, 2020Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
VIPL-Audio-Visual-Speech-Understanding / VIPL-AVSU-Group
View on GitHub
Collection of works from VIPL-AVSU
☆50Jul 21, 2026Updated last week
spytensor / keras_image_classifier
View on GitHub
use keras to do image classification tasks
☆12Dec 29, 2018Updated 7 years ago
omelchert / optfrog
View on GitHub
Analytic signal spectrograms with optimized time-frequency resolution
☆10Oct 6, 2020Updated 5 years ago
joonson / yousaidthat
View on GitHub
You Said That?: Synthesising Talking Faces from Audio
☆70Apr 29, 2018Updated 8 years ago
mmaciej2 / kaldi
View on GitHub
This is now the official location of the Kaldi project.
☆13Jun 10, 2019Updated 7 years ago
cmetz / python-matrixio-hal
View on GitHub
Python Matrix Creator / Voice HAL
☆12May 10, 2018Updated 8 years ago
crystal-method / Looking-to-Listen
View on GitHub
☆40Jul 19, 2018Updated 8 years ago
rgrzeszi / bof-aed
View on GitHub
Bag-of-Features Acoustic Event Detection
☆14Oct 5, 2016Updated 9 years ago
IQTLabs / Audio-Sensor-Toolkit
View on GitHub
A guide and set of tools for working with TinyML powered Audio Sensors
☆20Sep 17, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
omoju / Fundamentals
View on GitHub
Computer Science, Data Science and ML Fundamentals
☆11May 30, 2025Updated last year
uzeful / VA_Project
View on GitHub
Cross-modality (visual-auditory) Metric Learning Project
☆15Dec 19, 2017Updated 8 years ago
deeplearningzhy / DL
View on GitHub
TensorFlow，DCGAN，VAE，LSTM，CNN，Acoustic Scene Classification
☆11Jun 5, 2019Updated 7 years ago
ovshake / cobra
View on GitHub
Code for COBRA: Contrastive Bi-Modal Representation Algorithm (https://arxiv.org/abs/2005.03687)
☆15Jul 6, 2023Updated 3 years ago
mispchallenge / misp2022_baseline
View on GitHub
☆33Jun 26, 2023Updated 3 years ago
oawiles / FAb-Net
View on GitHub
Pytorch code for BMVC 2018 paper
☆87Feb 26, 2020Updated 6 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
JackBurdick / ASR_DL
View on GitHub
☆13Feb 5, 2018Updated 8 years ago
xing96 / MIM-lipreading
View on GitHub
Code and model for paper <Mutual Information Maximization for Effective Lip Reading>
☆19Sep 4, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
gooofy / kaldi-adapt-lm
View on GitHub
Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model
☆33Jan 26, 2020Updated 6 years ago
bogireddytejareddy / face-tracker
View on GitHub
Face Tracker using RetinaFace Detector and Kalman Filter
☆44Jul 13, 2019Updated 7 years ago
OuYangMinOa / Lyto-Different-Color
View on GitHub
using opencv play Lyto Different Color
☆10Apr 28, 2020Updated 6 years ago
multitel-ai / urban-sound-classification-and-comparison
View on GitHub
Urban Sound Classification : striving towards a fair comparison
☆17Dec 11, 2020Updated 5 years ago
itsyoavshalev / End-to-End-Lip-Synchronization-with-a-Temporal-AutoEncoder
View on GitHub
☆22Mar 31, 2022Updated 4 years ago
sarang0909 / faq_chatbot
View on GitHub
COVID-19 FAQ chatbot in python along with user interfce
☆10Feb 2, 2024Updated 2 years ago
celebrity-audio-collection / videoprocess
View on GitHub
CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.
☆80Nov 9, 2019Updated 6 years ago