PAGADALA-USHA / TELUGU-DATASET
We created the Telugu dataset to address the challenge of building Automatic Speech Recognition (ASR) systems for Indian languages, considering the constraints of limited resources. The dataset includes recordings of common Telugu words spoken by native speakers from Andhra Pradesh and Telangana.
☆20Updated last year
Alternatives and similar repositories for TELUGU-DATASET:
Users that are interested in TELUGU-DATASET are comparing it to the libraries listed below
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆59Updated last year
- ☆39Updated last year
- ☆40Updated last year
- StoryDiffusion serverless worker☆16Updated 11 months ago
- ☆32Updated last year
- ☆43Updated last year
- ☆28Updated 10 months ago
- AniPortrait with Gradio: Audio-Driven Synthesis of Photorealistic Portrait Animation☆22Updated last year
- A Pixar-inspired dreambooth diffusion model.☆10Updated last year
- ☆14Updated last year
- (CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆29Updated 11 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆51Updated 10 months ago
- ☆20Updated last year
- ☆79Updated last year
- Cog wrapper for PASD Magnify☆16Updated last year
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆38Updated last year
- ☆20Updated last year
- ☆18Updated last year
- ☆53Updated 7 months ago
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation