cbaziotis / ekphrasisLinks
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
☆671Updated 4 months ago
Alternatives and similar repositories for ekphrasis
Users that are interested in ekphrasis are comparing it to the libraries listed below
Sorting:
- semi supervised guided topic model with custom guidedLDA☆511Updated 6 months ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆748Updated 3 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆599Updated last year
- Repository with all what is necessary for sentiment analysis and related areas☆540Updated last year
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆739Updated last year
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆439Updated 2 years ago
- Elegant and Easy Tweet Preprocessing in Python☆309Updated 2 years ago
- A Survey and Experiments on Annotated Corpora for Emotion Classification in Text☆235Updated 2 years ago
- Repository for TweetEval☆386Updated 3 years ago
- Text Similarity☆398Updated 5 years ago
- General purpose unsupervised sentence representations☆1,204Updated 3 years ago
- GSDMM: Short text clustering☆357Updated 2 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆317Updated 2 months ago
- Data repository for pretrained NLP models and NLP corpora.☆1,039Updated 7 years ago
- analyze text with empath☆337Updated 8 years ago
- Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx☆640Updated 4 years ago
- Sentence paraphrase generation at the sentence level☆408Updated 2 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆526Updated 11 months ago
- LexRank algorithm for text summarization☆231Updated last year
- The SentiWordNet sentiment lexicon☆332Updated 3 years ago
- TextRank implementation for Python 3.☆1,265Updated 2 years ago
- A framework to learn cross-lingual word embedding mappings☆649Updated 2 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,249Updated 3 months ago
- Compute Sentence Embeddings Fast!☆623Updated 2 years ago
- Catalog of abusive language data (PLoS 2020)☆317Updated last year
- Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)☆348Updated 2 years ago
- Steam review texting embedding analysis☆143Updated 2 years ago
- Calculates Word Mover's Distance Insanely Fast☆462Updated 2 years ago
- A sentence segmenter that actually works!☆305Updated 5 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,215Updated last year