dee-ex / aicovidvn115mLinks
Giải pháp của nhóm "đi thi", đạt được Hạng 3 vòng Về đích với AUC 0.92 trong cuộc thi AICovidVN115m
☆13Updated 4 years ago
Alternatives and similar repositories for aicovidvn115m
Users that are interested in aicovidvn115m are comparing it to the libraries listed below
Sorting:
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆68Updated 2 years ago
- Zalo AI Challenge 2020 - Top 2 @ Voice Verification☆15Updated 3 years ago
- 2nd place solution for ID R&D Voice Antispoofing Challenge☆15Updated 6 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆102Updated 4 years ago
- 1st Place Public Leaderboard Solution for ERC2019☆70Updated 5 years ago
- Solution by Nhi Vo for AICovidVN 115M Challenge: Covid Cough Detection Challenge☆10Updated 4 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 6 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Updated 6 years ago
- ☆12Updated 2 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆140Updated 8 months ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 5 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- Freesound Audio Tagging 2019☆95Updated 6 years ago
- A synthesized dataset for Vietnamese TTS task☆64Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Updated 6 years ago
- Vietnamese song lyric alignment framework☆68Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
- A fine-tuned Large Language Model (LLM) for the Vietnamese language based on the Llama 2 model.☆16Updated 2 years ago
- Deep multi-metric learning for text-independent speaker verification☆24Updated 5 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated 2 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆48Updated 4 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Updated last year
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
- Vietnamese self-supervised Wav2vec2 model☆61Updated 2 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 7 years ago