UniversalDependencies / UD_French-GSDLinks
☆25Updated last month
Alternatives and similar repositories for UD_French-GSD
Users that are interested in UD_French-GSD are comparing it to the libraries listed below
Sorting:
- NLP framework: sentence detector, tokeniser, pos-tagger and dependency parser☆51Updated 7 months ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆391Updated last month
- spaCy + UDPipe☆165Updated 3 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆153Updated last month
- Named Entity Recognition data for Europeana Newspapers☆173Updated 2 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆319Updated this week
- UIMA CAS processing library written in Python☆91Updated 2 months ago
- Linguistic and stylistic complexity measures for (literary) texts☆84Updated last year
- Various utilities for processing the data.☆216Updated this week
- Quickly extract multi-word phrases from a corpus☆195Updated 5 years ago
- coFR: COreference resolution tool for FRench (and singletons).☆26Updated 5 years ago
- 🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the…☆249Updated 2 years ago
- Language independent truecaser in Python.☆159Updated 4 years ago
- CONLL-U to Pandas DataFrame☆31Updated 8 years ago
- Alignment and annotation for comparable documents.☆22Updated 7 years ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Updated 6 years ago
- Custom French POS and lemmatizer based on Lefff for spacy☆68Updated 2 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 7 years ago
- English data☆218Updated 3 weeks ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆150Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆208Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 4 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 5 years ago
- Python framework for processing Universal Dependencies data☆58Updated 2 weeks ago
- German lemmatization with IWNLP as extension for spaCy☆26Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Updated 4 months ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆57Updated 3 weeks ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", pre…☆84Updated 4 years ago