This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video Podcasts published at LREC 2022.
☆12Sep 21, 2022Updated 3 years ago
Alternatives and similar repositories for Merkel-Podcast-Corpus
Users that are interested in Merkel-Podcast-Corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- ☆15Dec 11, 2021Updated 4 years ago
- Collection of works from VIPL-AVSU☆49Updated this week
- ☆22Mar 31, 2022Updated 4 years ago
- ☆23May 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Examine the impact of perceptual and its alternatives loss on GLO☆14Nov 22, 2021Updated 4 years ago
- [INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Datase…☆194Nov 5, 2024Updated last year
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆36Feb 15, 2024Updated 2 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆81Feb 27, 2025Updated last year
- ☆25Nov 17, 2025Updated 5 months ago
- ☆20Mar 20, 2026Updated last month
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆69Sep 9, 2019Updated 6 years ago
- The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild☆26Nov 23, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Audio-Visual Speech Recognition☆22Jul 7, 2025Updated 9 months ago
- ☆24Feb 20, 2024Updated 2 years ago
- code to help with tsne plotting☆16May 19, 2020Updated 5 years ago
- ☆179Jul 12, 2023Updated 2 years ago
- Official Code of CVPR 2023 Paper "VIVE3D: Viewpoint-Independent Video Editing using 3D-Aware GANs"☆36Jun 6, 2023Updated 2 years ago
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆35Jun 20, 2023Updated 2 years ago
- [ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, M…☆30Jan 28, 2025Updated last year
- Leveraging A-priori Knowledge in Predictive Business Process Monitoring☆10Jul 16, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Talking Head from Speech Audio using a Pre-trained Image Generator☆23May 7, 2024Updated last year
- ☆12Mar 19, 2025Updated last year
- Interactive visualization of the output of any binary classifier.☆14Oct 15, 2020Updated 5 years ago
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆166Sep 12, 2025Updated 7 months ago
- Using Claude Opus to reverse engineer code from MegaPortraits: One-shot Megapixel Neural Head Avatars☆96Nov 4, 2024Updated last year
- Create After Effects scripts in Python.☆13Jan 29, 2021Updated 5 years ago
- ☆429Nov 1, 2023Updated 2 years ago
- ☆12Dec 11, 2020Updated 5 years ago
- ☆13May 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆23Feb 26, 2026Updated 2 months ago
- ☆528Dec 26, 2023Updated 2 years ago
- Implementation of "Learning Deep Generative Models"☆12Jun 4, 2019Updated 6 years ago
- ☆33Mar 17, 2023Updated 3 years ago
- PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆215Aug 8, 2023Updated 2 years ago
- Support library for the MaskRCNN masks extracted on EPIC-KITCHENS-100☆14Dec 1, 2020Updated 5 years ago
- ☆11May 7, 2022Updated 3 years ago