rsling / texrex
texrex web page cleaning & ClaraX random walk crawler
☆11Updated 3 years ago
Alternatives and similar repositories for texrex
Users that are interested in texrex are comparing it to the libraries listed below
Sorting:
- A tool for automatic spelling normalization☆20Updated 4 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated 2 weeks ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Named entity annotation tool☆28Updated last year
- Named Entity Recognition☆19Updated last month
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆36Updated 2 years ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆75Updated this week
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Updated 6 years ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated 3 months ago
- You Actually Look Twice At it☆35Updated 3 months ago
- Editor for aligned parallel texts (personal desktop application).☆19Updated 4 years ago
- Text-Induced Corpus Clean-up☆20Updated last year
- ☆11Updated 4 years ago
- ☆16Updated 10 years ago
- Lexical data at Unicode☆68Updated 8 months ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- Format conversion and graphical representation of [Universal Dependencies](http://universaldependencies.org) trees.☆12Updated 8 months ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆36Updated 6 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated last year
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆35Updated last month
- PhiloLogic4☆38Updated 5 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- Program used to split text into segments☆26Updated 6 months ago
- A simple configurable tool for manipulating dependency trees.☆13Updated 4 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Updated 10 months ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 9 months ago