Telegram-Zalo / zac2022-lyric-alignmentView external linksLinks
Solution for Zalo AI Challenge 2022 - Lyrics Alignment
☆68Dec 5, 2022Updated 3 years ago
Alternatives and similar repositories for zac2022-lyric-alignment
Users that are interested in zac2022-lyric-alignment are comparing it to the libraries listed below
Sorting:
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆110Dec 25, 2022Updated 3 years ago
- Vietnamese song lyric alignment framework☆68Dec 11, 2022Updated 3 years ago
- ☆12Dec 15, 2022Updated 3 years ago
- Vietnamese self-supervised Wav2vec2 model☆61Nov 5, 2022Updated 3 years ago
- This is our project for the Mobile Development course at HCMUS.☆12Jan 13, 2023Updated 3 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- Zalo AI Challenge 2022 Liveness Detection☆18Dec 30, 2022Updated 3 years ago
- A web app for both Text-based and Visual Question Answering.☆13Nov 13, 2023Updated 2 years ago
- Top 2 Solution - Quy Nhon AI Hackathon 2022☆10Sep 16, 2022Updated 3 years ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- top 1 Zalo AI challenge 2021 task hum to song☆110Dec 22, 2021Updated 4 years ago
- Final Project for OOP Course - University of Science, VNUHCM☆10Feb 13, 2023Updated 3 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Dec 2, 2022Updated 3 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated last year
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- DALI datasets split used to train models presented in the paper Multilingual lyrics-to-audio alignment (ISMIR 2020).☆13May 25, 2021Updated 4 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆105Sep 3, 2021Updated 4 years ago
- Ai cũng có thể tự tạo chatbot bằng huấn luyện chỉ dẫn, với 12G GPU (RTX 3060) và khoảng vài chục MB dữ liệu☆114Jun 10, 2023Updated 2 years ago
- Cải thiện Elasticsearch trong bài toán semantic search sử dụng phương pháp Sentence Embeddings☆25May 27, 2021Updated 4 years ago
- ☆16Jan 20, 2025Updated last year
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- Bản dịch tiếng Việt của 100 bài luyện tập NLP (cập nhật bản 2020) dịch từ 言語処理100本ノック 2020 (https://nlp100.github.io/ja)☆24Jun 8, 2020Updated 5 years ago
- ☆18Dec 18, 2022Updated 3 years ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Jun 5, 2023Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Source code for Zalo AI 2021 submission☆142Dec 20, 2021Updated 4 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- Top 1 Quy Nhon AI Hackathon 2022 Challenge Smart Menu☆30Sep 4, 2022Updated 3 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Wav2vec 2.0 Self-Supervised Pretraining☆58Feb 6, 2025Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- ☆35Aug 27, 2021Updated 4 years ago
- Text frontend for ESPnet tts recipes☆34Jun 1, 2021Updated 4 years ago