cbaziotis / ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
☆666Updated last year
Alternatives and similar repositories for ekphrasis:
Users that are interested in ekphrasis are comparing it to the libraries listed below
- semi supervised guided topic model with custom guidedLDA☆504Updated 4 years ago
- A Survey and Experiments on Annotated Corpora for Emotion Classification in Text☆231Updated last year
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆589Updated 8 months ago
- GSDMM: Short text clustering☆355Updated 2 years ago
- Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentimen…☆197Updated 6 years ago
- Code for acl2017 paper "An unsupervised neural attention model for aspect extraction"☆338Updated 7 months ago
- End-to-end Neural Coreference Resolution☆524Updated 2 years ago
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆435Updated last year
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.☆746Updated 2 years ago
- Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence (NAACL 2019)☆509Updated 3 years ago
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,085Updated 5 years ago
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆250Updated 6 years ago
- Repository for TweetEval☆365Updated 2 years ago
- Keyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and othe…☆114Updated 5 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,100Updated last year
- Repository with all what is necessary for sentiment analysis and related areas☆539Updated last year
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,205Updated 5 months ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,225Updated last month
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆514Updated 4 months ago
- LexRank algorithm for text summarization☆231Updated 11 months ago
- Various Algorithms for Short Text Mining☆469Updated this week
- PyTorch deep learning models for document classification☆595Updated last year
- This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, an…☆557Updated 3 years ago
- Officially supported AllenNLP models☆538Updated 2 years ago
- Calculates Word Mover's Distance Insanely Fast☆461Updated last year
- Code for paper Fine-tune BERT for Extractive Summarization☆1,481Updated 3 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆351Updated 2 years ago
- Retrofitting Word Vectors to Semantic Lexicons☆375Updated 5 years ago
- Applying NLP transfer learning techniques to predict Tweet stance toward a topic☆107Updated 6 years ago
- Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)☆340Updated 2 years ago