This dataset is presented in the paper Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video Podcasts published at LREC 2022.
☆12Sep 21, 2022Updated 3 years ago
Alternatives and similar repositories for Merkel-Podcast-Corpus
Users that are interested in Merkel-Podcast-Corpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- ☆15Dec 11, 2021Updated 4 years ago
- The implementation of g2pL with a new open dataset.☆16May 14, 2023Updated 3 years ago
- ☆22Mar 31, 2022Updated 4 years ago
- Examine the impact of perceptual and its alternatives loss on GLO☆15Nov 22, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Generates training data for "Deferred Neural Rendering: Image Synthesis using Neural Textures" using OpenGL☆19Jan 31, 2020Updated 6 years ago
- [INTERSPEECH'24] Official repository for "MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Datase…☆194Nov 5, 2024Updated last year
- [NeurIPS 2023] AV-NeRF: Learning Neural Fields for Real-World Audio-Visual Scene Synthesis☆36Feb 15, 2024Updated 2 years ago
- ☆29Jun 15, 2022Updated 3 years ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆81Feb 27, 2025Updated last year
- ☆26Nov 17, 2025Updated 6 months ago
- Code for ACL 2022 paper "Semi-Supervised Formality Style Transfer with Consistency Training".☆17May 21, 2022Updated 4 years ago
- "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆69Sep 9, 2019Updated 6 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- VoxCeleb plugin for pyannote.database☆30Aug 4, 2021Updated 4 years ago
- Official implementation of 'Out-of-domain GAN inversion via Invertibility Decomposition for Photo-Realistic Human Face Manipulation'☆23Feb 29, 2024Updated 2 years ago
- Audio-Visual Speech Recognition☆24Jul 7, 2025Updated 10 months ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Nov 8, 2021Updated 4 years ago
- code to help with tsne plotting☆16May 19, 2020Updated 6 years ago
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- Retinaface get 80.99% in widerface hard val using mobilenet0.25.☆25May 14, 2020Updated 6 years ago
- ☆181Jul 12, 2023Updated 2 years ago
- ☆24Mar 30, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Code of CVPR 2023 Paper "VIVE3D: Viewpoint-Independent Video Editing using 3D-Aware GANs"☆36Jun 6, 2023Updated 2 years ago
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆35Jun 20, 2023Updated 2 years ago
- Talking Head from Speech Audio using a Pre-trained Image Generator☆22May 7, 2024Updated 2 years ago
- ☆12Mar 19, 2025Updated last year
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆167Sep 12, 2025Updated 8 months ago
- Using Claude Opus to reverse engineer code from MegaPortraits: One-shot Megapixel Neural Head Avatars☆96Nov 4, 2024Updated last year
- Create After Effects scripts in Python.☆13Jan 29, 2021Updated 5 years ago
- ☆429Nov 1, 2023Updated 2 years ago
- ☆12Dec 11, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13May 11, 2024Updated 2 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆23Feb 26, 2026Updated 2 months ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Dec 30, 2019Updated 6 years ago
- ☆527Dec 26, 2023Updated 2 years ago
- Implementation of "Learning Deep Generative Models"☆12Jun 4, 2019Updated 6 years ago
- ☆33Mar 17, 2023Updated 3 years ago
- PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆215Aug 8, 2023Updated 2 years ago