kooBH / drone-robust-gender-classificationLinks
인명 구조용 드론을 위한 음성 화자 인지 기술
☆32Updated 2 years ago
Alternatives and similar repositories for drone-robust-gender-classification
Users that are interested in drone-robust-gender-classification are comparing it to the libraries listed below
Sorting:
- Sound Source Localization for PCM-A10 Microphone☆34Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋☆25Updated 3 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆24Updated 3 years ago
- Sound Source Localization for AI Grand Challenge 2021☆21Updated 3 years ago
- ☆28Updated 2 years ago
- Sound Source Localization for PCM-A10 Microphone☆25Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆22Updated 3 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆76Updated 2 years ago
- Implementation of Korean FastSpeech2☆215Updated 2 years ago
- ☆11Updated 2 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆79Updated 2 years ago
- Accurate Box Proposal Network for Scene Text Detection☆30Updated 3 years ago
- The Introduction of the OLKAVS Dataset☆33Updated last year
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Updated last year
- 2023 한국어 AI 경진대회☆10Updated 2 years ago
- OCR DB including Korean☆27Updated 4 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆165Updated 5 years ago
- 3rd Grand Challenge track 3 DB developed by GIST☆35Updated 4 years ago
- ☆100Updated 2 years ago
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆39Updated 4 months ago
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Updated last year
- ☆13Updated 4 years ago
- RNN-Transducer for korean☆45Updated 5 years ago
- 로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼☆108Updated 11 months ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆57Updated 2 years ago
- Code for the Active Speakers in Context Paper (CVPR2020)☆56Updated 4 years ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆58Updated last year
- ☆87Updated 3 years ago
- ☆59Updated 2 years ago
- Multi-speaker & Multi-style TTS☆29Updated last year