UniversalDependencies / UD_Italian-ISDT
โ19Updated 3 months ago
Related projects: โ
- The Italian NLP Toolโ70Updated last year
- ๐ฎ๐น Italian BERT and ELECTRA models (incl. evaluation)โ17Updated last year
- UmBERTo: an Italian Language Model trained with Whole Word Masking.โ104Updated last year
- AlBERTo the first italian BERT model for Twitter languange understandingโ70Updated 4 years ago
- A large scale dataset for Question Answering in Italianโ24Updated 5 years ago
- GilBERTo: A pretrained language model based on RoBERTa for Italianโ73Updated 4 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", preโฆโ82Updated 3 years ago
- โ15Updated 7 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more โฆโ111Updated 4 months ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.โ76Updated 3 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT modelsโ153Updated last year
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern striโฆโ22Updated 2 years ago
- German Morphological Analyzerโ45Updated 2 years ago
- UIMA CAS processing library written in Pythonโ84Updated 4 months ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐นโ30Updated 3 months ago
- โ13Updated 4 months ago
- A minimal, pure Python library to interface with CoNLL-U format files.โ149Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doโฆโ72Updated 2 months ago
- Hunspell extension for spaCy 2.0.โ94Updated last month
- A tokenizer and sentence splitter for German and English web and social media texts.โ135Updated last month
- spaCy + UDPipeโ159Updated 2 years ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpusโ17Updated 3 months ago
- Compiled tools, datasets, and other resources for historical text normalization.โ16Updated 5 years ago
- Official repository of the Hate Speech Detection Tasks at Evalitaโ12Updated 3 years ago
- Open German WordNetโ87Updated 7 months ago
- A python wrapper for the multilingual temporal tagger HeidelTime.โ26Updated 2 years ago
- Dutch word embeddings, trained on a large collection of Dutch social media messages and news/blog/forum posts.โ43Updated 2 years ago
- UD Greekโ21Updated 4 months ago
- Sentiment analysis and emotion classification for Italian using BERT (fine-tuning). Published at the WASSA workshop (EACL2021).โ24Updated 2 months ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "Whatโs so special about BERTโs โฆโ133Updated last year