kooBH / PCM-A10-SSLLinks
Sound Source Localization for PCM-A10 Microphone
☆34Updated 3 years ago
Alternatives and similar repositories for PCM-A10-SSL
Users that are interested in PCM-A10-SSL are comparing it to the libraries listed below
Sorting:
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆32Updated 3 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋☆25Updated 3 years ago
- Sound Source Localization for PCM-A10 Microphone☆25Updated 3 years ago
- ☆28Updated 3 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆24Updated 3 years ago
- Sound Source Localization for AI Grand Challenge 2021☆21Updated 4 years ago
- Sound Source Localization for AI Grand Challenge 2021☆22Updated 4 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆79Updated 2 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆76Updated 2 years ago
- Implementation of Korean FastSpeech2☆215Updated 3 years ago
- Accurate Box Proposal Network for Scene Text Detection☆30Updated 3 years ago
- ☆13Updated 4 years ago
- Multi-speaker & Multi-style TTS☆29Updated last year
- 2023 한국어 AI 경진대회☆10Updated 2 years ago
- ☆102Updated 2 years ago
- ☆19Updated last year
- The Introduction of the OLKAVS Dataset☆37Updated last year
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Updated last year
- Simple Tensorflow implementation of "Toward Spatially Unbiased Generative Models" (ICCV 2021)☆15Updated 4 years ago
- ☆87Updated 3 years ago
- Korean Text Data Generator for OCR tasks.☆10Updated 5 years ago
- ☆49Updated 3 years ago
- Updated folk of g2pk☆13Updated 2 years ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆43Updated last year
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆41Updated 5 months ago
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆29Updated 2 years ago
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Updated 2 years ago
- 로봇의 감정 및 개성을 표현할 수 있는 대화형 음성합성 오픈소스 플랫폼☆108Updated last year
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆46Updated 2 years ago
- OCR DB including Korean☆27Updated 4 years ago