cbaziotis / ekphrasisLinks
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
☆672Updated 6 months ago
Alternatives and similar repositories for ekphrasis
Users that are interested in ekphrasis are comparing it to the libraries listed below
Sorting:
- semi supervised guided topic model with custom guidedLDA☆512Updated 7 months ago
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆741Updated last year
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆600Updated last year
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆319Updated 4 months ago
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆438Updated 2 years ago
- TextRank implementation for Python 3.☆1,267Updated 2 years ago
- Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)☆346Updated 3 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆527Updated last year
- GSDMM: Short text clustering☆357Updated 2 years ago
- Elegant and Easy Tweet Preprocessing in Python☆310Updated 2 years ago
- Repository with all what is necessary for sentiment analysis and related areas☆541Updated 2 years ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆748Updated 3 years ago
- The SentiWordNet sentiment lexicon☆333Updated 3 years ago
- A framework to learn cross-lingual word embedding mappings☆651Updated 2 years ago
- Repository for TweetEval☆388Updated 3 years ago
- Catalog of abusive language data (PLoS 2020)☆320Updated last year
- Python Keyphrase Extraction module☆1,586Updated 2 years ago
- PyTorch deep learning models for document classification☆596Updated 2 years ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,402Updated 3 weeks ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆367Updated 2 years ago
- Calculates Word Mover's Distance Insanely Fast☆462Updated 2 years ago
- General purpose unsupervised sentence representations☆1,204Updated 3 years ago
- End-to-end Neural Coreference Resolution☆526Updated 3 years ago
- Stanford Open Information Extraction made simple!☆670Updated last year
- ☆234Updated 8 years ago
- LexRank algorithm for text summarization☆231Updated last year
- High-accuracy NLP parser with models for 11 languages.☆900Updated 3 years ago
- analyze text with empath☆338Updated 8 years ago
- A Survey and Experiments on Annotated Corpora for Emotion Classification in Text☆234Updated 2 years ago
- A curated list of resources dedicated to text summarization☆1,541Updated 2 years ago