Python tool for normilizing text and text canonicalization (DISCONTINUED)
☆41Sep 3, 2013Updated 12 years ago
Alternatives and similar repositories for text-normalization
Users that are interested in text-normalization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Links to data used in Sproat & Jaitly (https://arxiv.org/abs/1611.00068) experiments.☆77Jul 9, 2021Updated 4 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- Normalize text string☆12Nov 6, 2018Updated 7 years ago
- ☆213Jun 16, 2018Updated 7 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Sep 27, 2019Updated 6 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- A calculator with equations and variables☆13Mar 23, 2016Updated 10 years ago
- Free noise reduction of speech signals☆12Jul 26, 2016Updated 9 years ago
- ☆19May 11, 2024Updated last year
- Various scripts that facilitate the preparation of Automatic Speech Recognition related resources☆17Apr 16, 2020Updated 5 years ago
- Lightweight ngram random text generator☆12Jul 11, 2014Updated 11 years ago
- REST service to call the Festival text to speech application☆24Jan 16, 2019Updated 7 years ago
- Data preparation code for building Kaldi ASR system☆14Mar 18, 2017Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…☆63May 13, 2020Updated 5 years ago
- A tool for text normalisation via character-level machine translation☆13Jun 12, 2020Updated 5 years ago
- ☆19Jun 4, 2020Updated 5 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Extremely easy to use sequence to sequence library with attention, for text to text conversion tasks.☆39Sep 30, 2020Updated 5 years ago
- In order to demonstrate any signal accurately it is important to know the noise containt in the signal. Thus, a fundamental measure is th…☆13May 10, 2021Updated 4 years ago
- Api.ai English Speech Recognition (ASR) Model for Kaldi☆35Dec 27, 2020Updated 5 years ago
- Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.☆43Jan 16, 2020Updated 6 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Analyze XML extracted from PDFs (e.g. from TET or PDFMiner)☆20Jan 11, 2018Updated 8 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆31Feb 2, 2026Updated 2 months ago
- Long audio alignment using Kaldi☆23Apr 22, 2021Updated 4 years ago
- This is the repo that contains the databases for the Europython Challenge 2016 proposed by Plethora☆10Jul 12, 2016Updated 9 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 4 years ago
- Language resources for the Snips Natural Language Understanding (NLU)☆34Sep 10, 2019Updated 6 years ago
- c++ Kaldi IO lib (static and dynamic).☆25Nov 26, 2018Updated 7 years ago
- Variational autoencoder implementation in tensorflow following the classic paper by Kingma and Welling.☆13Jul 12, 2017Updated 8 years ago
- Deep-learning based sentence auto-segmentation from unstructured text w/o punctuation☆37May 14, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- SOTA punctation restoration (for e.g. automatic speech recognition) deep learning model based on BERT pre-trained model☆182May 17, 2019Updated 6 years ago
- Sberbank Data Science Contest 2017. Задача B: построение вопрос-ответной системы.☆11Nov 7, 2018Updated 7 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Dereverberation of Speech Signals Using Weighted Prediction Error☆23May 17, 2019Updated 6 years ago
- Generalized Language Modeling toolkit☆52Jun 21, 2022Updated 3 years ago
- repository for mental health discussions☆16Apr 29, 2017Updated 8 years ago
- Korean text normalization and language preparation package for LM in Kaldi-based ASR system☆63Apr 23, 2020Updated 5 years ago