UniversalDependencies / UD_Italian-ISDT
☆20Updated last week
Alternatives and similar repositories for UD_Italian-ISDT
Users that are interested in UD_Italian-ISDT are comparing it to the libraries listed below
Sorting:
- The Italian NLP Tool☆71Updated 2 years ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆105Updated 2 years ago
- 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)☆18Updated 2 years ago
- AlBERTo the first italian BERT model for Twitter languange understanding☆72Updated 4 years ago
- Compound splitter for German☆105Updated 5 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆45Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆142Updated 5 months ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated last year
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆75Updated 3 years ago
- Master repo for the UniMorph project, includes the UniMorph schema and annotated data files☆28Updated 5 years ago
- spaCy + UDPipe☆161Updated 3 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 3 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 10 months ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆157Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆246Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 9 months ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆83Updated 3 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆192Updated 4 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆158Updated this week
- A large scale dataset for Question Answering in Italian☆27Updated 6 years ago
- ☆23Updated 3 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- A modern, interlingual wordnet interface for Python☆243Updated this week
- ✔️Contextual word checker for better suggestions (not actively maintained)☆413Updated 3 months ago
- Various utilities for processing the data.☆209Updated last week
- ☆64Updated 2 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆106Updated 2 weeks ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- Disambiguate is a tool for training and using state of the art neural WSD models☆60Updated 2 years ago