cbaziotis / ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
☆664Updated 10 months ago
Alternatives and similar repositories for ekphrasis:
Users that are interested in ekphrasis are comparing it to the libraries listed below
- semi supervised guided topic model with custom guidedLDA☆501Updated 4 years ago
- A framework to learn cross-lingual word embedding mappings☆648Updated last year
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,211Updated last year
- TextRank implementation for Python 3.☆1,250Updated last year
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)☆432Updated last year
- Various Algorithms for Short Text Mining☆466Updated last week
- Calculates Word Mover's Distance Insanely Fast☆461Updated last year
- Topic Modeling in Embedding Spaces☆551Updated last year
- Text Similarity☆404Updated 4 years ago
- Code for acl2017 paper "An unsupervised neural attention model for aspect extraction"☆338Updated 5 months ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆311Updated this week
- 💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy☆728Updated 5 months ago
- A python tool for evaluating the quality of sentence embeddings.☆2,091Updated 9 months ago
- End-to-end Neural Coreference Resolution☆525Updated 2 years ago
- ☆228Updated 8 years ago
- InferSent sentence embeddings☆2,285Updated 3 years ago
- GSDMM: Short text clustering☆355Updated 2 years ago
- General purpose unsupervised sentence representations☆1,198Updated 2 years ago
- 🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy☆1,357Updated this week
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,105Updated 4 months ago
- A Python package implementing a new interpretable machine learning model for text classification (with visualization tools for Explainabl…☆338Updated this week
- Retrofitting Word Vectors to Semantic Lexicons☆374Updated 5 years ago
- Python Keyphrase Extraction module☆1,571Updated last year
- Repository with all what is necessary for sentiment analysis and related areas☆534Updated last year
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆583Updated 5 months ago
- A Survey and Experiments on Annotated Corpora for Emotion Classification in Text☆229Updated last year
- PyTorch deep learning models for document classification☆593Updated last year
- Compute Sentence Embeddings Fast!☆618Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago
- Python port of Moses tokenizer, truecaser and normalizer☆490Updated 7 months ago