google-research-datasets / uninum
A database of number names for 186 languages, locales, and scripts
☆67Updated 2 years ago
Alternatives and similar repositories for uninum:
Users that are interested in uninum are comparing it to the libraries listed below
- Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain☆60Updated 5 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 10 years ago
- ☆21Updated 5 years ago
- Corpus preprocessing☆95Updated 11 months ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Updated 2 years ago
- Automatic extraction of edited sentences from text edition histories.☆82Updated 3 years ago
- bilingual dictionary extractor from parallel corpora☆22Updated 10 years ago
- Neural macine translation soft alignment visualisations for web and command line☆72Updated 3 years ago
- We release a dataset based on Wikipedia sentences and the corresponding translations in 6 different languages along with the scores (scal…☆81Updated 3 years ago
- LSTM Language Model with Subword Units Input Representations☆42Updated 3 years ago
- ☆49Updated 3 years ago
- Assessing syntactic abilities of BERT☆39Updated 5 years ago
- ☆42Updated 6 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago
- Multi-lingual Text Processing☆96Updated 6 years ago
- English text corrector by using deep neural networks in Pytorch☆47Updated 7 years ago
- Microsoft Speech Language Translation (MSLT) Corpus☆19Updated 7 years ago
- Team Kakao&Brain's Grammatical Error Correction System for the ACL 2019 BEA Shared Task☆92Updated 5 years ago
- Code for the paper "Multi-Task Learning for Domain-General Spoken Disfluency Detection in Dialogue Systems" (Igor Shalyminov, Arash Eshgh…☆24Updated 2 years ago
- Efficient Markov Chain word alignment☆53Updated 3 years ago
- RNNs for Text Normalization☆38Updated 7 years ago
- An LSTM RNN for restoring missing punctuation in unsegmented text.☆79Updated 8 years ago
- ☆12Updated 9 years ago
- Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions o…☆101Updated last year
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆221Updated 2 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆72Updated 5 years ago
- XenC: open-source data selection tool for NLP☆63Updated 8 years ago
- Summary of Responses to Questionnaire on Annotation Platform https://forms.gle/iZk8kehkjAWmB8xe9☆59Updated 4 years ago