Hyejin-Koo / EE4178Links

Introduction to Artificial Intelligence(Deep Learning)

☆7

Alternatives and similar repositories for EE4178

Users that are interested in EE4178 are comparing it to the libraries listed below

Sorting:

kooBH / drone-robust-gender-classification
인명 구조용 드론을 위한 음성 화자 인지 기술
☆33Updated 2 years ago
kooBH / PCM-A10-SSL
Sound Source Localization for PCM-A10 Microphone
☆35Updated 2 years ago
clovaai / lookwhostalking
Look Who’s Talking: Active Speaker Detection in the Wild
☆72Updated last year
kuai-lab / sound-guided-semantic-image-manipulation
Sound-guided Semantic Image Manipulation - Official Pytorch Code (CVPR 2022)
☆80Updated last year
cloor / Speech-Emotion-Recognition-ROS
For Korean speech emotion detect, this model is trained by Korean dataset. There is no enough Korean dataset, so i tried to make this rep…
☆9Updated 3 years ago
msh9184 / contrastive-equilibrium-learning
☆21Updated 4 years ago
treblenalto / korean-speech-emotion-recognition
한국어 STT를 통한 감정 분류 - Emotion recognition through Korean speech dataset (provided by AI-Hub)
☆9Updated 3 years ago
skaws2003 / pytorch-mfcc
A pytorch implementation of MFCC.
☆33Updated 3 years ago
sooftware / kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
☆621Updated 2 years ago
dacon-ai / K-fashion-baseline
☆13Updated 4 years ago
clovaai / ClovaCall
ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
☆221Updated 3 years ago
selfcontrol7 / Korean_Voice_Phishing_Detection
All codes implemented on Korean voice phishing detection papers
☆16Updated last month
shleee47 / Sound-Source-Localization
Sound Source Localization for AI Grand Challenge 2021
☆22Updated 3 years ago
joonson / voxceleb_unsupervised
Augmentation adversarial training for self-supervised speaker recognition
☆79Updated 3 years ago
Sato-Kunihiko / audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
☆218Updated last year
seongmin-kye / meta-SR
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
☆74Updated 4 years ago
JoungheeKim / K-wav2vec
☆86Updated 2 years ago
ms-dot-k / Visual-Audio-Memory
PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)
☆20Updated 3 years ago
Donghwa-KIM / audiotext-transformer
cross-modal model between audio(MFCC) and text(KoBERT)
☆12Updated 4 years ago
fd873630 / RNN-Transducer
RNN-Transducer for korean
☆43Updated 4 years ago
Derpimort / VGGVox-PyTorch
Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.
☆25Updated 4 years ago
afourast / avobjects
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
☆113Updated 4 years ago
sooftware / speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
☆64Updated 3 years ago
shleee47 / mpWAV-Sound-Source-Localization
Sound Source Localization for AI Grand Challenge 2021
☆23Updated 3 years ago
AiTeRLab-GIST / GC_track4_violence_detection_GIST
Grand Challenge 4 track 2 sourcecode developed by GIST
☆13Updated 4 years ago
JackSyu / Discriminative-Multi-modality-Speech-Recognition
TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"
☆26Updated 3 years ago
shvdiwnkozbw / Multi-Source-Sound-Localization
This repo aims to perform sound localization in complex audiovisual scenes, where there multiple objects making sounds.
☆85Updated 3 years ago
fd873630 / deep_speech_2_korean
한국어 음성 인식을 위한 deep speech 2
☆27Updated 5 years ago
gogyzzz / localatt_emorecog
A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'
☆41Updated 6 years ago
rhgao / co-separation
Co-Separating Sounds of Visual Objects (ICCV 2019)
☆96Updated last year