FerdinandZhong / punctuator
A small seq2seq punctuator tool based on DistilBERT
☆50Updated 2 months ago
Alternatives and similar repositories for punctuator:
Users that are interested in punctuator are comparing it to the libraries listed below
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- 📝An easy-to-use package to restore punctuation of the text.☆112Updated last year
- The case study and multilingfual performance of ICASSP submission☆20Updated 2 years ago
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots☆114Updated last year
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆180Updated 5 years ago
- Token-Level Supervised Contrastive Learning for Punctuation Restoration☆29Updated 3 years ago
- This project attempts to maintain the SOTA performance in machine translation☆108Updated 4 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆146Updated 4 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆43Updated 3 years ago
- Mirror of SRILM☆55Updated 4 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆83Updated this week
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆154Updated last year
- asr2k☆49Updated 8 months ago
- ☆55Updated last year
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆64Updated 11 months ago
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆59Updated 3 years ago
- Punctuation restoration in ASR text☆32Updated 5 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated 3 weeks ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆67Updated 2 months ago
- RNNs for Text Normalization☆38Updated 7 years ago
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆61Updated 4 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆49Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆110Updated 2 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆267Updated last month
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 5 years ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆108Updated 5 months ago
- Awesome TTS☆55Updated 3 years ago
- A curated list of awesome disfluency detection publications along with the released code and bibliographical information☆73Updated 3 years ago
- The Fisher and CALLHOME Spanish–English Speech Translation Corpus☆39Updated 3 years ago