cbaziotis / ekphrasisLinks
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
β670Updated last month
Alternatives and similar repositories for ekphrasis
Users that are interested in ekphrasis are comparing it to the libraries listed below
Sorting:
- semi supervised guided topic model with custom guidedLDAβ510Updated 2 months ago
- π₯ Use the latest Stanza (StanfordNLP) research models directly in spaCyβ735Updated 10 months ago
- Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorExβ634Updated 4 years ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.β748Updated 2 years ago
- Elegant and Easy Tweet Preprocessing in Pythonβ308Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)β594Updated 11 months ago
- β234Updated 8 years ago
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)β437Updated 2 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?β524Updated 8 months ago
- TextRank implementation for Python 3.β1,258Updated 2 years ago
- GSDMM: Short text clusteringβ356Updated 2 years ago
- PyTorch deep learning models for document classificationβ594Updated last year
- A Survey and Experiments on Annotated Corpora for Emotion Classification in Textβ234Updated 2 years ago
- Calculates Word Mover's Distance Insanely Fastβ461Updated last year
- Repository for TweetEvalβ378Updated 3 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.β316Updated 3 weeks ago
- Text Similarityβ400Updated 5 years ago
- Stanford Open Information Extraction made simple!β659Updated last year
- General purpose unsupervised sentence representationsβ1,204Updated 2 years ago
- High-accuracy NLP parser with models for 11 languages.β890Updated 3 years ago
- Data repository for pretrained NLP models and NLP corpora.β1,023Updated 7 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherβ¦β1,232Updated 5 months ago
- Python Keyphrase Extraction moduleβ1,580Updated 2 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.β359Updated 2 years ago
- ConvoKit is a toolkit for extracting conversational features and analyzing social phenomena in conversations. It includes several large cβ¦β584Updated last week
- Catalog of abusive language data (PLoS 2020)β314Updated last year
- Sentence paraphrase generation at the sentence levelβ407Updated 2 years ago
- Compute Sentence Embeddings Fast!β623Updated 2 years ago
- A curated list of resources dedicated to text summarizationβ1,542Updated 2 years ago
- Repository with all what is necessary for sentiment analysis and related areasβ540Updated last year