proycon / foliapy
An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.
☆18Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for foliapy
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated 8 months ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- A deep learning architecture for reference mining from literature in the arts and humanities.☆15Updated 5 years ago
- A tool to extract canonical references from text.☆20Updated 3 years ago
- Simple spaCy-based concept extraction API, involving a dictionary of relevant concepts.☆10Updated 5 years ago
- Ontolex modules☆30Updated last year
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 5 months ago
- Tool for generating filtered Wikidata RDF exports☆37Updated 2 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 2 years ago
- Python API for KB data-services☆18Updated 4 years ago
- Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…☆17Updated 3 weeks ago
- Citation Classification using hybrid neural network model for Wikipedia References☆27Updated last year
- A deep learning model for extracting references from text☆25Updated last year
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.☆14Updated 2 years ago
- ☆16Updated 5 years ago
- Schema for modelling parliamentary debates☆21Updated 2 years ago
- Tools for TICCL☆14Updated last month
- This repository contains simple code in Python to help historians prepare data for quantitative analysis & visualization. Visit the follo…☆27Updated 8 months ago
- Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl,…☆74Updated this week
- Import entities from another Wikibase instance (e.g. Wikidata)☆13Updated last year
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆47Updated last week
- Topic Modeling Workflow in Python☆16Updated last year
- Frontend for Korp, a tool using the IMS Open Corpus Workbench (CWB).☆16Updated this week
- Softcite software mention recognizer, finding mentions and citations to software from within the academic literature☆68Updated 2 weeks ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 2 years ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated last year
- 📦 The Knowledge Box - A data dependency management framework to help users to publish, find and install data models☆44Updated 10 months ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆39Updated last year
- A module for Omeka S that provides an API for the Neatline 3 single page application☆13Updated last year