Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
☆675Jun 2, 2025Updated 11 months ago
Alternatives and similar repositories for ekphrasis
Users that are interested in ekphrasis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentimen…☆199Jun 8, 2018Updated 7 years ago
- Deep-learning models of NTUA-SLP team submitted in SemEval 2018 tasks 1, 2 and 3.☆85Jun 21, 2022Updated 3 years ago
- A service for downloading twitter streaming data. You can save the data either in text files on disk, or in a database (MongoDB).☆23Dec 1, 2018Updated 7 years ago
- Elegant and Easy Tweet Preprocessing in Python☆309Apr 17, 2023Updated 3 years ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"☆96Nov 2, 2023Updated 2 years ago
- Deep-learning Transfer Learning models of NTUA-SLP team submitted at the IEST of WASSA 2018 at EMNLP 2018.☆32Dec 27, 2022Updated 3 years ago
- POS tagging models for Hindi English Code Mixed Tweets☆11Aug 1, 2018Updated 7 years ago
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Sep 26, 2023Updated 2 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,380Oct 27, 2025Updated 7 months ago
- ☆75Jul 2, 2021Updated 4 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,107Mar 19, 2024Updated 2 years ago
- NLP, before and after spaCy☆2,242Sep 22, 2023Updated 2 years ago
- Fixes contractions such as `you're` to `you are`☆318Nov 15, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Data augmentation for NLP☆4,658Jun 24, 2024Updated last year
- Twitter word embeddings generated using Word2Vec and FastText.☆47Aug 17, 2019Updated 6 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,267Jul 24, 2025Updated 10 months ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,049Jan 9, 2024Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆609Jul 22, 2024Updated last year
- Tokenizer for Twitter and Reddit data☆45Apr 14, 2019Updated 7 years ago
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆251May 4, 2018Updated 8 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,961Jul 28, 2024Updated last year
- An open-source NLP research library, built on PyTorch.☆11,896Nov 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Python Keyphrase Extraction module☆1,591Jul 12, 2023Updated 2 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,181Aug 28, 2024Updated last year
- A fast, efficient universal vector embedding utility package.☆1,659Aug 3, 2023Updated 2 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆141Aug 15, 2022Updated 3 years ago
- ☆55Mar 24, 2022Updated 4 years ago
- Python library for Natural Language Preprocessing (NLPre)☆190Jul 31, 2023Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆364May 5, 2026Updated 3 weeks ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆358Feb 22, 2022Updated 4 years ago
- A framework to learn cross-lingual word embedding mappings☆654Apr 22, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A generic library for crafting adversarial NLP examples - WIP☆41Oct 26, 2018Updated 7 years ago
- 🧹 Python package for text cleaning☆1,018May 15, 2026Updated 2 weeks ago
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020☆606Jun 4, 2020Updated 5 years ago
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆18Oct 18, 2022Updated 3 years ago
- 🦆 Contextually-keyed word vectors☆1,673Mar 27, 2026Updated 2 months ago
- InferSent sentence embeddings☆2,279Aug 30, 2021Updated 4 years ago
- Cyber Hate detection And tracking on Social mEdia☆30Jan 12, 2023Updated 3 years ago