pmarcis / nlp-example
The code here provides a simple example of some NLP tasks for plain text processing for English and Latvian
☆7Updated 5 years ago
Alternatives and similar repositories for nlp-example
Users that are interested in nlp-example are comparing it to the libraries listed below
Sorting:
- This is a step by step tutorial for text analyst who want an easy start to basic and and common techniques in NLP, Text Analysis, Machine…☆18Updated 2 years ago
- spaCy + UDPipe☆161Updated 3 years ago
- Distributed infrastructure for Machine Translation web services (using Moses, Python, JSON-RPC/web interface)☆33Updated 3 years ago
- Neural network based lemmatizer for Finnish language☆11Updated 4 years ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆30Updated 6 years ago
- Harassment Lexicon and Corpus☆30Updated 6 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Sentence specificity prediction☆25Updated 6 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- 🤹♀️ Query spaCy's linguistic annotations using GraphQL☆86Updated 6 years ago
- ☆39Updated 3 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- Model training tutorials for the Stanza Python NLP Library☆39Updated 2 years ago
- public repository of the interdisciplinary working group 'Hatespeech' of the research training group UCSM☆17Updated 6 years ago
- Extracting narrative timelines (i.e. order and timing of events) from text☆20Updated 6 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆380Updated 5 months ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated last year
- Extract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm☆40Updated 7 years ago
- ☆32Updated 6 years ago
- An evaluation of word-embeddings for classification☆32Updated 6 years ago
- Plan and train German transformer models.☆23Updated 4 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 2 years ago
- Coreference resolution for German☆16Updated 7 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 6 years ago
- Clinical spelling correction with word and character n-gram embeddings.☆74Updated 2 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago