kooBH / PCM-A10-SSLLinks
Sound Source Localization for PCM-A10 Microphone
☆35Updated 2 years ago
Alternatives and similar repositories for PCM-A10-SSL
Users that are interested in PCM-A10-SSL are comparing it to the libraries listed below
Sorting:
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆33Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술 데이터셋☆26Updated 2 years ago
- 인명 구조용 드론을 위한 음성 화자 인지 기술☆25Updated 2 years ago
- Sound Source Localization for PCM-A10 Microphone☆26Updated 2 years ago
- ☆29Updated 2 years ago
- Sound Source Localization for AI Grand Challenge 2021☆22Updated 3 years ago
- Sound Source Localization for AI Grand Challenge 2021☆23Updated 3 years ago
- Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)☆80Updated last year
- Implementation of Korean FastSpeech2☆217Updated 2 years ago
- ☆12Updated 2 years ago
- Accurate Box Proposal Network for Scene Text Detection☆31Updated 3 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆72Updated last year
- 2023 한국어 AI 경진대회☆10Updated last year
- All codes implemented on Korean voice phishing detection papers☆16Updated 2 months ago
- Korean Text Data Generator for OCR tasks.☆10Updated 4 years ago
- OCR DB including Korean☆28Updated 3 years ago
- ☆99Updated 2 years ago
- The Introduction of the OLKAVS Dataset☆31Updated last year
- ☆86Updated 2 years ago
- 오디오 전처리 작업을 위한 연습☆25Updated 6 years ago
- Multi-speaker & Multi-style TTS☆29Updated last year
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Updated last year
- Official implementation of "ViSAGe: Video-to-Spatial AUdio Generation" (ICLR 2025)☆27Updated 2 months ago
- Official PyTorch implementation of ReWaS (AAAI'25) "Read, Watch and Scream! Sound Generation from Text and Video"☆42Updated 8 months ago
- The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.☆31Updated last year
- a PyTorch implementation of Lip2Wav☆51Updated 2 years ago
- ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)☆222Updated 3 years ago
- Disentangled Speech Embeddings using Cross-Modal Self-Supervision☆161Updated 5 years ago
- Korean Streaming ASR(with Denoiser and Conformer CTC)☆25Updated last year
- For Korean speech emotion detect, this model is trained by Korean dataset. There is no enough Korean dataset, so i tried to make this rep…☆8Updated 3 years ago