PeterisP / LVTagger
☆17Updated last month
Related projects ⓘ
Alternatives and complementary repositories for LVTagger
- Full Stack of Latvian Language Resources for Natural Language Understanding (NLU) and Generation (NLG)☆14Updated 2 years ago
- A Named-Entity Recogniser based on Grobid.☆49Updated last month
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆54Updated this week
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆60Updated this week
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆110Updated 4 months ago
- Detect and align similar passages☆88Updated 2 months ago
- A Java UIMA-based toolbox for multilingual and efficient terminology extraction an multilingual term alignment☆38Updated 7 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- A scalable and shareable repository of text annotation☆20Updated this week
- A tool for automatic spelling normalization☆20Updated 3 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 4 months ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆65Updated last month
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆29Updated 2 years ago
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆74Updated this week
- Multi Tier Annotation Search☆26Updated 3 years ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆40Updated this week
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆17Updated 5 months ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 6 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- 📂 Additional lookup tables and data resources for spaCy☆98Updated last year
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆111Updated 6 months ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆50Updated 4 years ago
- WordNet-LMF formats☆20Updated this week
- Named Entity Recognition data for Europeana Newspapers☆172Updated last year
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 3 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆91Updated last year
- CLI for loading Wikidata subsets (or all of it) into Elasticsearch☆67Updated 2 years ago
- ☆18Updated this week
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated 7 months ago