nlpaueb / gr-nlp-toolkit
The Greek NLP toolkit for Python. Supports NER/DP/POS Tagging/Greeklish-to-Greek Transliteration. Visit the playground here: https://huggingface.co/spaces/AUEB-NLP/greek-nlp-toolkit-demo (paper presented at COLING 2025)
☆53Updated last week
Alternatives and similar repositories for gr-nlp-toolkit:
Users that are interested in gr-nlp-toolkit are comparing it to the libraries listed below
- A Greek edition of BERT pre-trained language model☆142Updated 5 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆243Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆211Updated last month
- A python package for text preprocessing task in natural language processing.☆63Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆320Updated last month
- Clustering sentence embeddings to extract message intent☆169Updated 3 years ago
- ☆52Updated 10 months ago
- Text analysis with networks.☆286Updated 8 months ago
- A module to compute textual lexical richness (aka lexical diversity).☆98Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆104Updated 9 months ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆255Updated 2 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆156Updated 2 years ago
- A Dataset of German Legal Documents for Named Entity Recognition☆163Updated 2 years ago
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆56Updated 11 months ago
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- A french sequence to sequence pretrained model☆57Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆89Updated last year
- Dataset for Emotion Recognition Research☆204Updated 2 years ago
- Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data☆147Updated last year
- Repository for Zheng and Guha et al., 2021, "When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Data…☆86Updated last year
- Creating class-based TF-IDF matrices☆82Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 7 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆397Updated 3 years ago
- A spaCy wrapper for DBpedia Spotlight☆107Updated last year
- A Dutch RoBERTa-based language model☆198Updated 9 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Updated 9 months ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆155Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆255Updated 6 months ago
- Unannotated Spanish 3 Billion Words Corpora☆94Updated 2 years ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆135Updated last year