heraclex12 / vietpunc
Vietnamese Punctuation Prediction using Pretrained Language Models
☆13Updated 2 years ago
Alternatives and similar repositories for vietpunc:
Users that are interested in vietpunc are comparing it to the libraries listed below
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆21Updated 8 months ago
- Transformation spoken text to written text☆30Updated 10 months ago
- ☆12Updated last month
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆19Updated last year
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆80Updated 9 months ago
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- ☆25Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆22Updated 3 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆46Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated last year
- ☆44Updated 7 months ago
- VietTTS: An Open-Source Vietnamese Text to Speech☆38Updated 3 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 6 months ago
- ☆38Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- ☆56Updated 2 years ago
- ASCEND Chinese-English code-switching dataset☆24Updated 2 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆40Updated last year
- Diarization Metric in One: current support DER, JER, CDER, SER, and BER☆9Updated 2 years ago
- ☆36Updated 6 months ago
- Clustering-based methods for overlapping diarization☆78Updated last year
- ☆15Updated 2 years ago
- ☆45Updated 2 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- A simple command line tool to calculate WER for ASR.☆14Updated 5 months ago