cbaziotis / ekphrasisLinks
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
β671Updated 4 months ago
Alternatives and similar repositories for ekphrasis
Users that are interested in ekphrasis are comparing it to the libraries listed below
Sorting:
- semi supervised guided topic model with custom guidedLDAβ511Updated 5 months ago
- π₯ Use the latest Stanza (StanfordNLP) research models directly in spaCyβ739Updated last year
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.β317Updated 2 months ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)β596Updated last year
- EmbedRank: Unsupervised Keyphrase Extraction using Sentence Embeddings (official implementation)β439Updated 2 years ago
- Python Implementations of Word Sense Disambiguation (WSD) Technologies.β749Updated 3 years ago
- PyTorch deep learning models for document classificationβ595Updated 2 years ago
- A Survey and Experiments on Annotated Corpora for Emotion Classification in Textβ235Updated 2 years ago
- Text Similarityβ400Updated 5 years ago
- Compute Sentence Embeddings Fast!β622Updated 2 years ago
- GSDMM: Short text clusteringβ357Updated 2 years ago
- Catalog of abusive language data (PLoS 2020)β317Updated last year
- Python Keyphrase Extraction moduleβ1,582Updated 2 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?β526Updated 11 months ago
- Calculates Word Mover's Distance Insanely Fastβ462Updated 2 years ago
- End-to-end Neural Coreference Resolutionβ526Updated 3 years ago
- The SentiWordNet sentiment lexiconβ332Updated 3 years ago
- LexRank algorithm for text summarizationβ230Updated last year
- Elegant and Easy Tweet Preprocessing in Pythonβ310Updated 2 years ago
- Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorExβ639Updated 4 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherβ¦β1,246Updated 2 months ago
- Repository for TweetEvalβ386Updated 3 years ago
- A framework to learn cross-lingual word embedding mappingsβ647Updated 2 years ago
- Repository with all what is necessary for sentiment analysis and related areasβ540Updated last year
- Abstractive summarisation using Bert as encoder and Transformer Decoderβ413Updated 2 years ago
- A sentence segmenter that actually works!β305Updated 5 years ago
- General purpose unsupervised sentence representationsβ1,205Updated 3 years ago
- TextRank implementation for Python 3.β1,264Updated 2 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Pythonβ272Updated 2 years ago
- Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)β348Updated 2 years ago