kooBH / PCM-A10-SSLLinks
Sound Source Localization for PCM-A10 Microphone
☆35Updated 2 years ago
Alternatives and similar repositories for PCM-A10-SSL
Users that are interested in PCM-A10-SSL are comparing it to the libraries listed below
Sorting:
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆33Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋☆26Updated 2 years ago
- Sound Source Localization for PCM-A10 Microphone☆26Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆25Updated 2 years ago
- ☆29Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆22Updated 3 years ago
- Sound Source Localization for AI Grand Challenge 2021☆23Updated 3 years ago
- ☆12Updated 2 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated last year
- Accurate Box Proposal Network for Scene Text Detection☆31Updated 3 years ago
- Implementation of Korean FastSpeech2☆217Updated 2 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆72Updated last year
- Official Implementation of Visual Transformer Pooling for Lip reading☆40Updated 2 years ago
- ☆13Updated 4 years ago
- 2023 한국어 AI 경진대회☆10Updated last year
- The Introduction of the OLKAVS Dataset☆31Updated last year
- ☆99Updated 2 years ago
- OCR DB including Korean☆28Updated 3 years ago
- All codes implemented on Korean voice phishing detection papers☆16Updated last month
- a PyTorch implementation of Lip2Wav☆51Updated 2 years ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆42Updated 7 months ago
- Korean phoneme dictionary generator for training Montreal Forced Aligner (MFA)☆13Updated 4 years ago
- Visual Speech Recognition For Low-Resource Languages with Automatic Labels (ICASSP 2024)☆14Updated 4 months ago
- 한국어 STT를 통한 감정 분류 - Emotion recognition through Korean speech dataset (provided by AI-Hub)☆9Updated 3 years ago
- For Korean speech emotion detect, this model is trained by Korean dataset. There is no enough Korean dataset, so i tried to make this rep…☆8Updated 3 years ago
- ☆45Updated 2 years ago
- Korean Text Data Generator for OCR tasks.☆10Updated 4 years ago
- Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.☆8Updated 3 years ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆52Updated last year
- Official implementation of Transpotter, published in BMVC 2021☆16Updated 2 years ago