Yoctol / text-normalizerLinks

Normalize text string

☆12

Alternatives and similar repositories for text-normalizer

Users that are interested in text-normalizer are comparing it to the libraries listed below

Sorting:

voidism / pywordseg
Open Source State-of-the-art Chinese Word Segmentation System with BiLSTM and ELMo. https://arxiv.org/abs/1901.05816
☆45Updated 4 years ago
MiuLab / Lattice-ELMo
Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"
☆18Updated 2 years ago
iamyuanchung / TOEFL-QA
A question answering dataset for machine comprehension of spoken content
☆78Updated 7 years ago
dimalik / ats
☆13Updated 8 years ago
MiuLab / HNLG
Natural Language Generation by Hierarchical Decoding with Linguistic Patterns (NAACL-HLT 2018), Investigating Linguistic Pattern Ordering…
☆32Updated 6 years ago
Chia-Hsuan-Lee / ODSQA
ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
☆63Updated 3 years ago
MiuLab / MUSE
Modularizing Unsupervised Sense Embedding
☆29Updated 7 years ago
sonos / spoken-language-understanding-research-datasets
☆49Updated 3 years ago
johncf / text2phones
Attentional Neural Network that translates text to phones.
☆11Updated 7 years ago
aasish / userIntentDataset
☆14Updated 8 years ago
voidful / TFkit
🤖📇 handling multiple nlp task in one pipeline
☆56Updated last month
shawnwun / woz
The wizard of oz code used for collecting goal-oriented dialogue systems
☆13Updated 7 years ago
yvchen / ContextualSLU
☆47Updated 7 years ago
Tianxu-Jia / LM-GEC
Grammatical Error Correction Based on Language Model(BERT, GPT-2), and Seq2Seq
☆18Updated 5 years ago
neulab / extreme-adaptation-for-personalized-translation
Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"
☆42Updated 3 years ago
khiajohnson / SpiCE-Corpus
An open-access corpus of conversational bilingual speech in Cantonese and English
☆40Updated 3 years ago
lukasgarbas / can-we-tune-together
Combining encoder-based language models
☆11Updated 3 years ago
gcunhase / StackedDeBERT
Stacked Denoising BERT for Noisy Text Classification (Neural Networks 2020)
☆32Updated 2 years ago
claravania / subword-lstm-lm
LSTM Language Model with Subword Units Input Representations
☆42Updated 4 years ago
dbd-challenge / dbdc3
☆10Updated 6 years ago
TurkuNLP / wikibert
BERT models for many languages created from Wikipedia texts
☆33Updated 5 years ago
90217 / joint-intent-classification-and-slot-filling-based-on-BERT
BERT for joint intent classification and slot filling
☆39Updated 5 years ago
jackalhan / qa_datasets_converter
Formate converter from one type of qa task datasets to another type
☆39Updated 6 years ago
lverwimp / tf-lm
Language modeling scripts based on TensorFlow
☆58Updated 5 years ago
simonjisu / pytorch_tutorials
some tutorials for blog: simonjisu.github.io
☆23Updated 4 years ago
mayhewsw / pytorch-truecaser
A simple neural truecaser written in pytorch and allennlp.
☆33Updated last year
dsindex / iclassifier
reference pytorch code for intent classification
☆45Updated 9 months ago
microsoft / Distilled-Sentence-Embedding
Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementation
☆32Updated 2 years ago
voidful / awesome-question-answering-dataset
A list of awesome machine question answering dataset - 機器問答數據集
☆15Updated 5 years ago
Kaleidophon / token2index
A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …
☆51Updated 8 months ago