DanielMengLiu/DeepLip

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DanielMengLiu/DeepLip)

DanielMengLiu / DeepLip

deep-learning based audio-visual lip bometrics

☆15

Alternatives and similar repositories for DeepLip

Users that are interested in DeepLip are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

deepaudio / deepaudio-speaker
View on GitHub
neural network based speaker embedder
☆24Jan 7, 2023Updated 3 years ago
okankop / ASDNet
View on GitHub
Audio-Visual Active Speaker Detection with PyTorch on AVA-ActiveSpeaker dataset
☆73Jan 18, 2022Updated 4 years ago
Jiang-Yidi / TS-TalkNet
View on GitHub
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
☆61May 29, 2023Updated 3 years ago
xiaoxiaomiao323 / MSA
View on GitHub
☆16Feb 19, 2026Updated 5 months ago
vzxxbacq / PLDA
View on GitHub
This is a implementation of kaldi-plda.
☆15Jun 9, 2018Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
mavceleb / mavceleb_baseline
View on GitHub
☆11Nov 5, 2025Updated 8 months ago
Tiago-Roxo / WASD
View on GitHub
☆20Mar 20, 2026Updated 4 months ago
vskadandale / vocalist
View on GitHub
Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices
☆73Apr 7, 2024Updated 2 years ago
Miffyli / asv-cm-reinforce
View on GitHub
Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE
☆13Mar 31, 2021Updated 5 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
placebokkk / ctc-asr
View on GitHub
pytorch CTC implementation for ASR. Use eesen's fst decoder framework
☆10Feb 27, 2020Updated 6 years ago
One-Shot-Voice-Conversion-with-WIN / WINVC
View on GitHub
Official implementation of "WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization".
☆30Nov 13, 2021Updated 4 years ago
vvestman / pytorch-ivectors
View on GitHub
GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…
☆63Oct 15, 2019Updated 6 years ago
yufan-aslp / AliMeeting
View on GitHub
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆142Jun 10, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
aysebilgegunduz / ShotBoundaryDetection
View on GitHub
Detects shot boundaries from news with K-Means. Using Bhattacharya Coefficient for distance.
☆10Jun 1, 2017Updated 9 years ago
bsxfan / meta-embeddings
View on GitHub
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆23Nov 23, 2018Updated 7 years ago
zcxu-eric / AVA-AVD
View on GitHub
☆51Nov 24, 2022Updated 3 years ago
fhlt / shot_boundary_detection
View on GitHub
shot_boundary_detection
☆10Nov 26, 2019Updated 6 years ago
georgesterpu / avsr-tf1
View on GitHub
Audio-Visual Speech Recognition using Sequence to Sequence Models
☆84Jul 10, 2020Updated 6 years ago
ciodar / UniversalAttribution
View on GitHub
[ECCVW/TWYN 2024 - Best Workshop Paper] Are CLIP features all you need for Universal Synthetic Image Origin Attribution?
☆14Mar 27, 2026Updated 3 months ago
liyongze / lstm_speaker_verification
View on GitHub
☆35Apr 8, 2019Updated 7 years ago
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
haoyanbin918 / Attention-in-Attention
View on GitHub
☆12Aug 5, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
smeetrs / deep_avsr
View on GitHub
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
☆244Feb 15, 2024Updated 2 years ago
Gorilla-Lab-SCUT / TTAC2
View on GitHub
[TPAMI 2024] The official implementation of "Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clu…
☆13Mar 19, 2024Updated 2 years ago
fuankarion / active-speakers-context
View on GitHub
Code for the Active Speakers in Context Paper (CVPR2020)
☆58May 19, 2021Updated 5 years ago
shvdiwnkozbw / Self-supervised-Video-Concept
View on GitHub
Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.
☆11Jul 28, 2022Updated 3 years ago
lawlict / ECAPA-TDNN
View on GitHub
☆106Sep 2, 2021Updated 4 years ago
yuyq96 / D-TDNN
View on GitHub
PyTorch implementation of Densely Connected Time Delay Neural Network
☆91May 4, 2023Updated 3 years ago
danhuan / photoshow
View on GitHub
本特效由H5+CSS3+JS携手打造，3D场景照片墙，环形，倒影，可拖动打造良好的视觉效果和用户体验。主要使用transfrom transition等H5技术。本特效不支持IE8及以下版本。
☆18Aug 29, 2017Updated 8 years ago
WangHelin1997 / SpecAugment-plus
View on GitHub
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆34Jun 25, 2021Updated 5 years ago
JusperLee / Looking-to-Listen-at-the-Cocktail-Party
View on GitHub
Executable code based on Google articles
☆166Dec 8, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BUTSpeechFIT / MultiSV
View on GitHub
MultiSV: scripts for data preparation
☆31Jan 18, 2025Updated last year
placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
bsxfan / PSDA
View on GitHub
Probabilistic Spherical Discriminant Analysis
☆12Oct 29, 2022Updated 3 years ago
SwinTransformer / Simple-21K-Detection
View on GitHub
☆13Jul 20, 2022Updated 4 years ago
shanguanma / Aligners
View on GitHub
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
facebookresearch / MMCSG
View on GitHub
This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …
☆41Mar 13, 2024Updated 2 years ago
Junhua-Liao / Light-ASD
View on GitHub
The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)
☆181Mar 23, 2025Updated last year