kooBH / drone-robust-gender-classificationLinks
인명 구조용 드론을 위한 음성 화자 인지 기술
☆33Updated 2 years ago
Alternatives and similar repositories for drone-robust-gender-classification
Users that are interested in drone-robust-gender-classification are comparing it to the libraries listed below
Sorting:
- Sound Source Localization for PCM-A10 Microphone☆35Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋☆26Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆25Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆22Updated 3 years ago
- Sound Source Localization for PCM-A10 Microphone☆26Updated 2 years ago
- ☆29Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆23Updated 3 years ago
- Implementation of Korean FastSpeech2☆217Updated 2 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated last year
- Look Who’s Talking: Active Speaker Detection in the Wild☆72Updated last year
- ☆12Updated 2 years ago
- 2023 한국어 AI 경진대회☆10Updated last year
- Accurate Box Proposal Network for Scene Text Detection☆31Updated 3 years ago
- All codes implemented on Korean voice phishing detection papers☆16Updated 2 months ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Updated last year
- The Introduction of the OLKAVS Dataset☆31Updated last year
- ☆99Updated 2 years ago
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Updated last year
- a PyTorch implementation of Lip2Wav☆51Updated 2 years ago
- Multi-speaker & Multi-style TTS☆29Updated last year
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆161Updated 5 years ago
- ☆86Updated 2 years ago
- OCR DB including Korean☆28Updated 3 years ago
- Korean Text Data Generator for OCR tasks.☆10Updated 4 years ago
- 3rd Grand Challenge track 3 DB developed by GIST☆36Updated 4 years ago
- Audio Only Speech Enhancement using Unet☆9Updated 4 years ago
- 오디오 전처리 작업을 위한 연습☆25Updated 6 years ago
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Updated 3 years ago
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆13Updated 4 months ago
- ☆57Updated 2 years ago