Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
☆676Jun 2, 2025Updated last year
Alternatives and similar repositories for ekphrasis
Users that are interested in ekphrasis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentimen…☆199Jun 8, 2018Updated 8 years ago
- Deep-learning models of NTUA-SLP team submitted in SemEval 2018 tasks 1, 2 and 3.☆85Jun 21, 2022Updated 3 years ago
- A service for downloading twitter streaming data. You can save the data either in text files on disk, or in a database (MongoDB).☆23Dec 1, 2018Updated 7 years ago
- Elegant and Easy Tweet Preprocessing in Python☆309Apr 17, 2023Updated 3 years ago
- Guide for the slp group on how to use the Grnet cluster☆11Apr 16, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"☆96Nov 2, 2023Updated 2 years ago
- Deep-learning Transfer Learning models of NTUA-SLP team submitted at the IEST of WASSA 2018 at EMNLP 2018.☆32Dec 27, 2022Updated 3 years ago
- POS tagging models for Hindi English Code Mixed Tweets☆11Aug 1, 2018Updated 7 years ago
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Sep 26, 2023Updated 2 years ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,376Oct 27, 2025Updated 7 months ago
- ☆75Jul 2, 2021Updated 4 years ago
- A python tool for evaluating the quality of sentence embeddings.☆2,108Mar 19, 2024Updated 2 years ago
- NLP, before and after spaCy☆2,241Sep 22, 2023Updated 2 years ago
- Fixes contractions such as `you're` to `you are`☆318Nov 15, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Data augmentation for NLP☆4,658Jun 12, 2026Updated last week
- Twitter word embeddings generated using Word2Vec and FastText.☆47Aug 17, 2019Updated 6 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,271Jul 24, 2025Updated 10 months ago
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,048Jan 9, 2024Updated 2 years ago
- BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)☆608Jul 22, 2024Updated last year
- Tokenizer for Twitter and Reddit data☆45Apr 14, 2019Updated 7 years ago
- Neat (Neural Attention) Vision, is a visualization tool for the attention mechanisms of deep-learning models for Natural Language Process…☆251May 4, 2018Updated 8 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,958Jul 28, 2024Updated last year
- An open-source NLP research library, built on PyTorch.☆11,892Nov 22, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Python Keyphrase Extraction module☆1,592Jul 12, 2023Updated 2 years ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,183Aug 28, 2024Updated last year
- A fast, efficient universal vector embedding utility package.☆1,662Aug 3, 2023Updated 2 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆141Aug 15, 2022Updated 3 years ago
- ☆55Mar 24, 2022Updated 4 years ago
- Python library for Natural Language Preprocessing (NLPre)☆190Jul 31, 2023Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆367May 5, 2026Updated last month
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- A framework to learn cross-lingual word embedding mappings☆654Apr 22, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A generic library for crafting adversarial NLP examples - WIP☆41Oct 26, 2018Updated 7 years ago
- 🧹 Python package for text cleaning☆1,022May 15, 2026Updated last month
- The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020☆606Jun 4, 2020Updated 6 years ago
- PyTorch source code of NAACL 2021 paper "Improving the Lexical Ability of Pretrained Language Models for Unsupervised Neural Machine Tran…☆18Oct 18, 2022Updated 3 years ago
- 🦆 Contextually-keyed word vectors☆1,672Mar 27, 2026Updated 2 months ago
- InferSent sentence embeddings☆2,279Aug 30, 2021Updated 4 years ago
- Cyber Hate detection And tracking on Social mEdia☆30Jan 12, 2023Updated 3 years ago