cbaziotis / ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
☆662Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for ekphrasis
- TextRank implementation for Python 3.☆1,249Updated last year
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆432Updated last year
- A framework to learn cross-lingual word embedding mappings☆647Updated last year
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆725Updated 3 months ago
- Python Keyphrase Extraction module☆1,566Updated last year
- semi supervised guided topic model with custom guidedLDA☆499Updated 4 years ago
- LexRank algorithm for text summarization☆229Updated 7 months ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,352Updated 5 months ago
- Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentimen…☆197Updated 6 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,097Updated 2 months ago
- Elegant and Easy Tweet Preprocessing in Python☆305Updated last year
- Repository with all what is necessary for sentiment analysis and related areas☆533Updated last year
- A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)☆686Updated 3 years ago
- Pre-trained ELMo Representations for Many Languages☆1,463Updated 3 years ago
- General purpose unsupervised sentence representations☆1,192Updated 2 years ago
- Utilizing BERT for Aspect-Based Sentiment Analysis via Constructing Auxiliary Sentence (NAACL 2019)☆499Updated 2 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,087Updated 8 months ago
- PyTorch deep learning models for document classification☆595Updated last year
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆512Updated 3 weeks ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆575Updated 4 months ago
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,083Updated 5 years ago
- The SentiWordNet sentiment lexicon☆322Updated 2 years ago
- ☆1,295Updated 2 years ago
- Pytorch-Named-Entity-Recognition-with-BERT☆1,212Updated 3 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆312Updated last month
- GSDMM: Short text clustering☆353Updated last year
- A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of lang…☆1,509Updated 4 months ago
- High-accuracy NLP parser with models for 11 languages.☆872Updated 2 years ago
- A curated list of resources dedicated to text summarization☆1,535Updated last year
- Retrofitting Word Vectors to Semantic Lexicons☆374Updated 5 years ago