heraclex12 / vietpuncLinks
Vietnamese Punctuation Prediction using Pretrained Language Models
☆13Updated 3 years ago
Alternatives and similar repositories for vietpunc
Users that are interested in vietpunc are comparing it to the libraries listed below
Sorting:
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆21Updated 10 months ago
- Transformation spoken text to written text☆30Updated last year
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcription☆41Updated 3 weeks ago
- Vi_G2P or ViG2P: G2P package for Vietnamese: based on vPhon and phonology knowledge to convert Raw text - Graphoneme to IPA☆87Updated 11 months ago
- ☆12Updated 4 months ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆39Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- finetune llm part for spark-tts model☆76Updated 2 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆77Updated 6 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Updated last year
- Diarization Metric in One: current support DER, JER, CDER, SER, and BER☆10Updated 2 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆20Updated last year
- ☆15Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆51Updated last week
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆16Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 6 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆24Updated last year
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆98Updated 3 years ago
- ☆25Updated 2 years ago
- English conversation corpus for conversational TTS.☆21Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- ☆17Updated 2 years ago
- 4G GPU & 10 Minutes for train☆12Updated last year
- ☆16Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- ☆34Updated 3 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆21Updated last year