kooBH / drone-robust-gender-classificationLinks
인명 구조용 드론을 위한 음성 화자 인지 기술
☆32Updated 2 years ago
Alternatives and similar repositories for drone-robust-gender-classification
Users that are interested in drone-robust-gender-classification are comparing it to the libraries listed below
Sorting:
- Sound Source Localization for PCM-A10 Microphone☆34Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋☆25Updated 2 years ago
- ☆28Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆24Updated 2 years ago
- Sound Source Localization for PCM-A10 Microphone☆25Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆21Updated 3 years ago
- Sound Source Localization for AI Grand Challenge 2021☆22Updated 3 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆75Updated 2 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated 2 years ago
- Implementation of Korean FastSpeech2☆215Updated 2 years ago
- All codes implemented on Korean voice phishing detection papers☆20Updated 6 months ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Updated last year
- ☆59Updated 2 years ago
- Accurate Box Proposal Network for Scene Text Detection☆30Updated 3 years ago
- 2023 한국어 AI 경진대회☆10Updated 2 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Updated 3 years ago
- 2019/04~2019/09 투빅스 Singing Voice Conversion☆40Updated 5 years ago
- 로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼☆108Updated 10 months ago
- The Introduction of the OLKAVS Dataset☆33Updated last year
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆44Updated last year
- ☆19Updated last year
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆15Updated 8 months ago
- ☆13Updated 4 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆165Updated 5 years ago
- a PyTorch implementation of Lip2Wav☆51Updated 3 years ago
- ☆48Updated 3 years ago
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆68Updated 2 years ago
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Updated 4 years ago
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆38Updated 3 months ago
- ☆100Updated 2 years ago