FerdinandZhong / punctuator
A small seq2seq punctuator tool based on DistilBERT
☆50Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for punctuator
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots☆113Updated last year
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆48Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆74Updated last year
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆179Updated 5 years ago
- The case study and multilingfual performance of ICASSP submission☆19Updated 2 years ago
- Language-agnostic BERT Sentence Embedding (LaBSE)☆140Updated 4 years ago
- Punctuation restoration in ASR text☆33Updated 5 years ago
- Mirror of SRILM☆54Updated 4 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆251Updated last month
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- SimulEval: A General Evaluation Toolkit for Simultaneous Translation☆102Updated 2 months ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆204Updated 3 months ago
- ☆54Updated last year
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆110Updated 9 months ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated 4 months ago
- ☆54Updated this week
- A python package for deep multilingual punctuation prediction.☆99Updated 3 months ago
- ☆33Updated 3 years ago
- A unified versatile interface for dialogue datasets☆16Updated 11 months ago
- Various speech datasets made available to the public☆99Updated 2 months ago
- An official implementation of "BPE-Dropout: Simple and Effective Subword Regularization" algorithm.☆48Updated 3 years ago
- ☆101Updated 3 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆81Updated last week
- ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET☆58Updated 2 years ago
- This project attempts to maintain the SOTA performance in machine translation☆108Updated 4 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated last year
- Awesome TTS☆54Updated 3 years ago
- Fast + Non-Autoregressive Grammatical Error Correction using BERT. Code and Pre-trained models for paper "Parallel Iterative Edit Models …☆228Updated last year
- TEXTOIR: An Integrated and Visualized Platform for Text Open Intent Recognition (ACL 2021)☆47Updated 2 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year