cbaziotis / ekphrasisLinks
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
☆673Updated 8 months ago
Alternatives and similar repositories for ekphrasis
Users that are interested in ekphrasis are comparing it to the libraries listed below
Sorting:
- semi supervised guided topic model with custom guidedLDA☆513Updated 9 months ago
- PyTorch deep learning models for document classification☆596Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆602Updated last year
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆439Updated 2 years ago
- A Survey and Experiments on Annotated Corpora for Emotion Classification in Text☆233Updated 2 years ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆744Updated last year
- TextRank implementation for Python 3.☆1,269Updated 2 years ago
- Repository for TweetEval☆393Updated 3 years ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆747Updated 3 years ago
- Elegant and Easy Tweet Preprocessing in Python☆309Updated 2 years ago
- Text Similarity☆399Updated 5 years ago
- Python Keyphrase Extraction module☆1,586Updated 2 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,259Updated 6 months ago
- General purpose unsupervised sentence representations☆1,208Updated 3 years ago
- A comparison and discussion of different NLP methods for 5-class sentiment classification on the SST-5 dataset.☆172Updated 9 months ago
- Code to obtain the CNN / Daily Mail dataset (non-anonymized) for summarization☆657Updated 3 years ago
- The SentiWordNet sentiment lexicon☆335Updated 3 years ago
- A framework to learn cross-lingual word embedding mappings☆652Updated 2 years ago
- Sentence paraphrase generation at the sentence level☆408Updated 3 years ago
- Catalog of abusive language data (PLoS 2020)☆321Updated last year
- A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of lang…☆1,559Updated 7 months ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆319Updated last week
- Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive…☆439Updated 8 months ago
- High-accuracy NLP parser with models for 11 languages.☆903Updated 4 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆368Updated 3 years ago
- Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for employing Transformer models in text classif…☆312Updated 5 years ago
- Compute Sentence Embeddings Fast!☆624Updated 2 years ago
- Repository with all what is necessary for sentiment analysis and related areas☆544Updated 2 weeks ago
- Abstractive summarisation using Bert as encoder and Transformer Decoder☆412Updated 2 years ago
- ☆234Updated 9 years ago