heraclex12 / vietpuncLinks
Vietnamese Punctuation Prediction using Pretrained Language Models
☆14Updated 3 years ago
Alternatives and similar repositories for vietpunc
Users that are interested in vietpunc are comparing it to the libraries listed below
Sorting:
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆22Updated 4 months ago
- Transformation spoken text to written text☆31Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated 2 years ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆99Updated last year
- Finetune Wa2vec 2.0 For Speech Recognition☆140Updated 8 months ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆64Updated 10 months ago
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription☆60Updated 2 weeks ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- finetune llm part for spark-tts model☆111Updated 7 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆85Updated 11 months ago
- Fine-Tune Whisper with Transformers and PEFT☆57Updated last year
- ☆39Updated 3 years ago
- ☆37Updated 4 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆102Updated 4 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated 3 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆55Updated 5 months ago
- ASCEND Chinese-English code-switching dataset☆30Updated 3 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆21Updated last year
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆25Updated 7 months ago
- ☆49Updated 2 years ago
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆68Updated 2 years ago
- ☆20Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆29Updated last week
- ☆25Updated 3 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 5 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆77Updated 4 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago