huntzhan / text-cleaner
simple text preprocessing tool
☆18Updated 7 years ago
Alternatives and similar repositories for text-cleaner:
Users that are interested in text-cleaner are comparing it to the libraries listed below
- LSTM Language Model with Subword Units Input Representations☆42Updated 3 years ago
- Dialog State Tracking Challenge 5 (DSTC5)☆38Updated 8 years ago
- Software for unsupervised word segmentation and language model learning using lattices☆45Updated 8 years ago
- An extremely simple Python wrapper for the SRI Language Modeling toolkit☆70Updated 10 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- EMNLP2015_code_Long Short-Term Memory Neural Networks for Chinese Word Segmentation☆77Updated 9 years ago
- ACL2015_code_Gated Recursive Neural Network for Chinese Word Segmentation☆28Updated 9 years ago
- Source code for an ACL2016 paper of Chinese word segmentation☆79Updated 6 years ago
- Attempt at using LSTMs to predict semantic relatedness of sentences (a la Tai et al. in Improved Semantic Representations From Tree-Struc…☆22Updated 9 years ago
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- An attempt to implement the TreeLSTM in Theano☆44Updated 8 years ago
- A program to correct non-word spelling error in sentences using ngram MAP Language Models, Noisy Channel Model, Error Confusion Matrix an…☆53Updated 4 years ago
- Experiment with document similarity via Matt Kusner's MWD paper☆24Updated 8 years ago
- ICASSP2017: End-to-end joint learning of natural language understanding and dialogue manager☆74Updated 7 years ago
- Code for paper "End-to-End Non-Factoid Question Answering with an Interactive Visualization of Neural Attention Weights"☆65Updated 6 years ago
- A Tensorflow implementation of DSSM (slightly modified).☆24Updated 8 years ago
- Implementation of ULMFit algorithm for text classification via transfer learning☆94Updated 6 years ago
- Source files to replicate experiments in my IWSDS 2016 paper.☆22Updated 8 years ago
- Non-distributional linguistic word vector representations.☆62Updated 7 years ago
- Decomposable Attention Model for Sentence Pair Classification (from https://arxiv.org/abs/1606.01933)☆95Updated 8 years ago
- Dialog State Tracking Challenge 6 (DSTC6)☆54Updated 7 years ago
- Textprep is an analyzing tool for both parallel and non-parallel corpus and its down-stream Natural Language Processing and Machine Trans…☆32Updated 6 years ago
- Word segmentation using neural networks based on package https://github.com/SUTDNLP/LibN3L☆23Updated 9 years ago
- BiLSTM-CRF for sequence labeling in Dynet☆81Updated 7 years ago
- CRFsuite with partial annotation. Used in our paper 'Domain adaptation for CRF-based Chinese word segmentation using free annotations'☆44Updated 8 years ago
- Language modeling scripts based on TensorFlow☆58Updated 5 years ago
- This is the code&dataset for the paper [Hierarchical Memory Networks for Answer Selection on Unknown Words. COLING 2016]☆44Updated 6 years ago
- ☆20Updated 6 years ago
- ☆35Updated 7 years ago
- FOFE NER☆40Updated 7 years ago