Donghwa-KIM / audiotext-transformer
cross-modal model between audio(MFCC) and text(KoBERT)
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for audiotext-transformer
- ☆82Updated last year
- Multimodal Transformer for Korean Sentiment Analysis with Audio and Text Features☆28Updated 3 years ago
- Review of papers I read☆14Updated 3 years ago
- A PyTorch Implementation of "Attention Is All You Need"☆38Updated 3 years ago
- A short tutorial on Keras for the co-utilization of audio and text data (multi-modal analysis)☆17Updated 2 years ago
- 2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib☆23Updated 4 years ago
- RNN-Transducer for korean☆39Updated 4 years ago
- This is project to analyze korquad 2.0☆24Updated 2 years ago
- ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)☆218Updated 2 years ago
- 오디오 전처리 작업을 위한 연습☆25Updated 5 years ago
- ☆11Updated 4 years ago
- ☆36Updated 3 years ago
- Pre-processing KsponSpeech corpus (Korean Speech dataset) provided by AI Hub.☆89Updated 2 years ago
- A Fairseq implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆11Updated 3 years ago
- AI grand challenge 2020 Repo (Speech Recognition Track)☆22Updated 2 years ago
- 한국어 음성인식 튜토리얼☆64Updated 4 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 3 years ago
- Repository for speech paper reading☆32Updated 3 years ago
- Context Modeling with Speaker's Pre-trained Memory Tracking for Emotion Recognition in Conversation (NAACL 2022)☆62Updated last year
- Korean Speech to English Translation Corpus☆42Updated 3 years ago
- PyTorch v1.2에서 생긴 Transformer API 를 이용한 간단한 Chitchat 챗봇☆49Updated 5 years ago
- ☆39Updated 5 years ago
- Transformer Implementation using PyTorch for Neural Machine Translation (Korean to English)☆69Updated 3 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 2 years ago
- Dimensional Emotion Detection from Categorical Emotion Annotation☆45Updated 3 years ago
- Data Augmentation Toolkit for Korean text.☆51Updated 3 years ago
- ☆12Updated 3 years ago
- PyTorch implementation of the RNN-based sequence-to-sequence architecture.☆22Updated 3 years ago