kooBH / drone-robust-gender-classificationLinks
인명 구조용 드론을 위한 음성 화자 인지 기술
☆33Updated 2 years ago
Alternatives and similar repositories for drone-robust-gender-classification
Users that are interested in drone-robust-gender-classification are comparing it to the libraries listed below
Sorting:
- Sound Source Localization for PCM-A10 Microphone☆35Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋☆26Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆22Updated 3 years ago
- ☆29Updated 2 years ago
- Sound Source Localization for PCM-A10 Microphone☆26Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆25Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆23Updated 3 years ago
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆13Updated 5 months ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆73Updated 2 years ago
- ☆12Updated 2 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated 2 years ago
- The Introduction of the OLKAVS Dataset☆31Updated last year
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Updated last year
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Updated last year
- ☆58Updated 2 years ago
- Accurate Box Proposal Network for Scene Text Detection☆31Updated 3 years ago
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆30Updated 2 months ago
- ☆13Updated 4 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Updated 3 years ago
- Implementation of Korean FastSpeech2☆216Updated 2 years ago
- a PyTorch implementation of Lip2Wav☆52Updated 2 years ago
- All codes implemented on Korean voice phishing detection papers☆16Updated 2 months ago
- ☆18Updated last year
- 2023 한국어 AI 경진대회☆10Updated last year
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆42Updated 8 months ago
- ☆99Updated 2 years ago
- Multi-speaker & Multi-style TTS☆29Updated last year
- Official implementation of Transpotter, published in BMVC 2021☆16Updated 3 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆162Updated 5 years ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆68Updated 6 months ago