PAGADALA-USHA / TELUGU-DATASETLinks
We created the Telugu dataset to address the challenge of building Automatic Speech Recognition (ASR) systems for Indian languages, considering the constraints of limited resources. The dataset includes recordings of common Telugu words spoken by native speakers from Andhra Pradesh and Telangana.
☆21Updated last year
Alternatives and similar repositories for TELUGU-DATASET
Users that are interested in TELUGU-DATASET are comparing it to the libraries listed below
Sorting:
- An AI focused photo manipulation tool based on Gradio☆185Updated 3 weeks ago
- A quality zero-shot lipsync pipeline built with MuseTalk, LivePortrait, and CodeFormer.☆42Updated 10 months ago
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.☆186Updated last year
- Our cutting-edge application harnesses the power of deep learning and computer vision to analyze skin images and predict potential diseas…☆12Updated last year
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆269Updated last month
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆205Updated last year
- ☆43Updated last year
- ⚡ AI Avatar Factory is an interface for creating and managing AI avatars. ⚡☆60Updated this week
- ☆55Updated 10 months ago
- Example scripts for using [my] fine-tuned CLIP models with HuggingFace 🤗☆13Updated 10 months ago
- [inactive] MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation☆13Updated last year
- ☆27Updated 8 months ago
- ☆28Updated last year
- ☆24Updated 3 weeks ago
- Official repo for DiffArtist (ACM MM 2025)☆121Updated 2 weeks ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆75Updated last month
- The best OSS video generation models☆134Updated 9 months ago
- Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevan…☆21Updated last year
- HunyuanVideo GP: Large Video Generation Model - GPU Poor version☆431Updated last month
- ReSwapper aims to reproduce the implementation of inswapper. This repository provides code for training, inference, and includes pretrain…☆191Updated last month
- ☆30Updated 7 months ago
- Alternative to Flawless AI's TrueSync. Make lips in video match provided audio using the power of Wav2Lip and GFPGAN.☆125Updated last year
- Create your own personal avatar for Zoom or Discord chats or even live streaming...☆11Updated 9 months ago
- ☆16Updated last year
- ☆79Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆35Updated 9 months ago
- ☆75Updated last year
- StoryDiffusion serverless worker☆17Updated last year
- ☆24Updated last year
- ☆16Updated 6 months ago