dee-ex / aicovidvn115mLinks
Giải pháp của nhóm "đi thi", đạt được Hạng 3 vòng Về đích với AUC 0.92 trong cuộc thi AICovidVN115m
☆13Updated 4 years ago
Alternatives and similar repositories for aicovidvn115m
Users that are interested in aicovidvn115m are comparing it to the libraries listed below
Sorting:
- Solution by Nhi Vo for AICovidVN 115M Challenge: Covid Cough Detection Challenge☆10Updated 4 years ago
- 1st Place Public Leaderboard Solution for ERC2019☆70Updated 5 years ago
- 2nd place solution for ID R&D Voice Antispoofing Challenge☆15Updated 6 years ago
- Zalo AI Challenge 2020 - Top 2 @ Voice Verification☆15Updated 3 years ago
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆68Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆104Updated 4 years ago
- https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/☆21Updated 7 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- A synthesized dataset for Vietnamese TTS task☆65Updated 3 years ago
- Freesound Audio Tagging 2019☆95Updated 6 years ago
- Collection of research papers on cough classification☆40Updated 5 years ago
- A fine-tuned Large Language Model (LLM) for the Vietnamese language based on the Llama 2 model.☆17Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆68Updated 4 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 6 years ago
- A collection of Audio and Speech pre-trained models.☆193Updated 5 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆145Updated 10 months ago
- 6th place solution to Freesound Audio Tagging 2019 kaggle competition☆25Updated 5 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- Repository of code for Speech emotion recognition using voiced speech and attention model, submitted to ICSigSys 2019☆13Updated 5 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
- End-to-End Speech Recognition Using Tensorflow☆41Updated 2 years ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Updated 6 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆90Updated last year
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆52Updated 6 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Updated last year
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- Urdu Language Speech Emotional Corpus☆46Updated 6 years ago