dee-ex / aicovidvn115mLinks
Giải pháp của nhóm "đi thi", đạt được Hạng 3 vòng Về đích với AUC 0.92 trong cuộc thi AICovidVN115m
☆13Updated 4 years ago
Alternatives and similar repositories for aicovidvn115m
Users that are interested in aicovidvn115m are comparing it to the libraries listed below
Sorting:
- 1st Place Public Leaderboard Solution for ERC2019☆70Updated 5 years ago
- Zalo AI Challenge 2020 - Top 2 @ Voice Verification☆15Updated 3 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆103Updated 4 years ago
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆68Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated 2 years ago
- 2nd place solution for ID R&D Voice Antispoofing Challenge☆15Updated 6 years ago
- Solution by Nhi Vo for AICovidVN 115M Challenge: Covid Cough Detection Challenge☆10Updated 4 years ago
- https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/☆21Updated 7 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Updated 2 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆142Updated 9 months ago
- A synthesized dataset for Vietnamese TTS task☆64Updated 3 years ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Updated 6 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 5 years ago
- Urdu Language Speech Emotional Corpus☆46Updated 6 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 6 years ago
- Deep multi-metric learning for text-independent speaker verification☆24Updated 5 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 5 years ago
- A fine-tuned Large Language Model (LLM) for the Vietnamese language based on the Llama 2 model.☆17Updated 2 years ago
- Collection of research papers on cough classification☆40Updated 5 years ago
- Some tutorials used for ASR class☆31Updated 4 years ago
- Vietnamese song lyric alignment framework☆68Updated 2 years ago
- An advance kaldi wrapper for Pyhton☆38Updated 4 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Updated 6 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆10Updated 7 years ago