kooBH / drone-robust-gender-classificationLinks
인명 구조용 드론을 위한 음성 화자 인지 기술
☆33Updated 2 years ago
Alternatives and similar repositories for drone-robust-gender-classification
Users that are interested in drone-robust-gender-classification are comparing it to the libraries listed below
Sorting:
- Sound Source Localization for PCM-A10 Microphone☆35Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋☆26Updated 2 years ago
- Sound Source Localization for PCM-A10 Microphone☆26Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆22Updated 3 years ago
- ☆29Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆25Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆23Updated 3 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆74Updated 2 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated 2 years ago
- All codes implemented on Korean voice phishing detection papers☆16Updated 4 months ago
- Implementation of Korean FastSpeech2☆216Updated 2 years ago
- 2023 한국어 AI 경진대회☆10Updated last year
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆13Updated 6 months ago
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Updated last year
- 3rd Grand Challenge track 3 DB developed by GIST☆36Updated 4 years ago
- Multi-speaker & Multi-style TTS☆29Updated last year
- ☆18Updated last year
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Updated last year
- a PyTorch implementation of Lip2Wav☆51Updated 3 years ago
- ☆100Updated 2 years ago
- The Introduction of the OLKAVS Dataset☆32Updated last year
- ☆13Updated 4 years ago
- Accurate Box Proposal Network for Scene Text Detection☆31Updated 3 years ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆43Updated 9 months ago
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆28Updated 2 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Updated 3 years ago
- ☆59Updated 2 years ago
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆31Updated 3 weeks ago
- Code for the Active Speakers in Context Paper (CVPR2020)☆55Updated 4 years ago
- ☆87Updated 2 years ago