Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
☆675Jun 2, 2025Updated 10 months ago
Alternatives and similar repositories for ekphrasis
Users that are interested in ekphrasis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentimen…☆200Jun 8, 2018Updated 7 years ago
- Deep-learning models of NTUA-SLP team submitted in SemEval 2018 tasks 1, 2 and 3.☆85Jun 21, 2022Updated 3 years ago
- A service for downloading twitter streaming data. You can save the data either in text files on disk, or in a database (MongoDB).☆23Dec 1, 2018Updated 7 years ago
- Elegant and Easy Tweet Preprocessing in Python☆309Apr 17, 2023Updated 2 years ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"☆96Nov 2, 2023Updated 2 years ago
- Deep-learning Transfer Learning models of NTUA-SLP team submitted at the IEST of WASSA 2018 at EMNLP 2018.☆32Dec 27, 2022Updated 3 years ago
- POS tagging models for Hindi English Code Mixed Tweets☆11Aug 1, 2018Updated 7 years ago
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Sep 26, 2023Updated 2 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,363Oct 27, 2025Updated 5 months ago
- ☆75Jul 2, 2021Updated 4 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,106Mar 19, 2024Updated 2 years ago
- NLP, before and after spaCy☆2,239Sep 22, 2023Updated 2 years ago
- Fixes contractions such as `you're` to `you are`☆319Nov 15, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Data augmentation for NLP☆4,656Jun 24, 2024Updated last year
- Twitter word embeddings generated using Word2Vec and FastText.☆47Aug 17, 2019Updated 6 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,266Jul 24, 2025Updated 8 months ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,050Jan 9, 2024Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆607Jul 22, 2024Updated last year
- Tokenizer for Twitter and Reddit data☆45Apr 14, 2019Updated 7 years ago
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆251May 4, 2018Updated 7 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,974Jul 28, 2024Updated last year
- An open-source NLP research library, built on PyTorch.☆11,893Nov 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python Keyphrase Extraction module☆1,592Jul 12, 2023Updated 2 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,182Aug 28, 2024Updated last year
- A fast, efficient universal vector embedding utility package.☆1,658Aug 3, 2023Updated 2 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆139Aug 15, 2022Updated 3 years ago
- ☆55Mar 24, 2022Updated 4 years ago
- Python library for Natural Language Preprocessing (NLPre)☆190Jul 31, 2023Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆363Mar 20, 2026Updated 3 weeks ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- A framework to learn cross-lingual word embedding mappings☆654Apr 22, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A generic library for crafting adversarial NLP examples - WIP☆41Oct 26, 2018Updated 7 years ago
- 🧹 Python package for text cleaning☆1,008Jan 28, 2026Updated 2 months ago
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020☆606Jun 4, 2020Updated 5 years ago
- 🦆 Contextually-keyed word vectors☆1,673Mar 27, 2026Updated 2 weeks ago
- InferSent sentence embeddings☆2,280Aug 30, 2021Updated 4 years ago
- Cyber Hate detection And tracking on Social mEdia☆30Jan 12, 2023Updated 3 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Oct 1, 2015Updated 10 years ago