WING-NUS / nus-sms-corpusLinks
This is the distribution point for the NUS SMS Corpus as described and updated from This is a corpus of SMS (Short Message Service) messages collected for research at the Department of Computer Science at the National University of Singapore. This dataset consists of 67,093 SMS messages taken from the corpus on Mar 9, 2015. The messages largely …
☆22Updated last year
Alternatives and similar repositories for nus-sms-corpus
Users that are interested in nus-sms-corpus are comparing it to the libraries listed below
Sorting:
- Convert word2vec vectors between binary and plain text format☆136Updated 5 years ago
- ☆97Updated 3 years ago
- ☆214Updated 6 years ago
- SemCor and Masc documents annotated with NOAD word senses.☆184Updated 5 years ago
- Different datasets for developing and testing keyword extraction algorithms☆109Updated 10 years ago
- SippyCup is a simple semantic parser, written in Python, created purely for didactic purposes.☆220Updated 6 years ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- Retrofitting Word Vectors to Semantic Lexicons☆376Updated 6 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆139Updated 2 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆154Updated 8 months ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- A large corpus of discourse annotations and relations on ~10K forum threads.☆240Updated 6 years ago
- Practical Natural Language Processing Tools for Humans. Dependency Parsing, Syntactic Constituent Parsing, Semantic Role Labeling, Named …☆194Updated 7 years ago
- Concatenated Power Mean Embeddings as Universal Cross-Lingual Sentence Representations☆186Updated 4 years ago
- C++ implementation of the Brown word clustering algorithm.☆427Updated last year
- It is a question-generator model. It takes text and an answer as input and outputs a question.☆170Updated 6 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆70Updated 6 years ago
- Converting GloVe vectors into word2vec format for easy usage with Gensim☆112Updated 6 years ago
- Formerly known as code.google.com/p/1-billion-word-language-modeling-benchmark☆448Updated 9 years ago
- Python wrapper for Stanford CoreNLP☆355Updated 4 years ago
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 3 years ago
- Stanford NLP group's shared Python tools.☆137Updated 7 years ago
- Python 3 Spelling Corrector☆177Updated last year
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆259Updated 8 years ago
- Fast, DB Backed pretrained word embeddings for natural language processing.☆222Updated 3 months ago
- Simple Wikipedia plain text extractor with article link annotations and Hadoop support.☆103Updated 14 years ago
- Uses Recurrent Neural Network (LSTM/GRU/basic_RNN units) for summarization of amazon reviews☆132Updated 7 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆142Updated 7 years ago
- State-of-the-art Supervised Sentence Simplification System from ACL 2014☆46Updated 6 years ago
- Next Utterance Classification☆135Updated 7 years ago