proycon/foliapy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/proycon/foliapy)

proycon / foliapy

An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.

☆18

Alternatives and similar repositories for foliapy

Users that are interested in foliapy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

proycon / spacy2folia
View on GitHub
Use spaCy for NLP and output to the FoLiA XML format.
☆12Feb 27, 2024Updated 2 years ago
CentreForDigitalHumanities / tscan
View on GitHub
T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf
☆19May 28, 2025Updated last year
LanguageMachines / ticcltools
View on GitHub
Tools for TICCL
☆14Dec 12, 2025Updated 7 months ago
LanguageMachines / libfolia
View on GitHub
FoLiA library for C++
☆18Mar 25, 2026Updated 3 months ago
LanguageMachines / PICCL
View on GitHub
A set of workflows for corpus building through OCR, post-correction and normalisation
☆50Sep 7, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
proycon / LaMachine
View on GitHub
LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…
☆69Sep 11, 2023Updated 2 years ago
martinreynaert / TICCL
View on GitHub
Text-Induced Corpus Clean-up
☆20Jun 20, 2023Updated 3 years ago
textexploration / mtas
View on GitHub
Multi Tier Annotation Search
☆12Jul 10, 2026Updated last week
LanguageMachines / ucto
View on GitHub
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…
☆72Jun 15, 2026Updated last month
CLARIAH / software-quality-guidelines
View on GitHub
Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)
☆18May 29, 2022Updated 4 years ago
ybracke / transnormer
View on GitHub
A lexical normalizer for historical spelling variants using a transformer architecture.
☆10Mar 12, 2025Updated last year
performant-software / DM
View on GitHub
DM is an environment for the study and annotation of images and texts. It is a suite of tools, enabling scholars to gather and organize t…
☆19Dec 10, 2018Updated 7 years ago
performant-software / faircopy
View on GitHub
FairCopy is a word processor for the humanities scholar.
☆16May 26, 2026Updated last month
CentreForDigitalHumanities / texcavator
View on GitHub
Text mining on the Royal Library newspaper corpus
☆11Dec 3, 2025Updated 7 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
proycon / folia
View on GitHub
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…
☆66Dec 9, 2025Updated 7 months ago
instituutnederlandsetaal / MBMP-morphological-parser
View on GitHub
A memory-based morphological parser for Python
☆16Oct 12, 2012Updated 13 years ago
suzanv / PairwisePreferenceLearning
View on GitHub
Performs pairwise preference ranking for a given trainfile and testfile with binary class labels (1 and not 1). The binary classification…
☆14Jul 12, 2017Updated 9 years ago
stjaenicke / TRAViz
View on GitHub
Text Re-use Alignment Visualization
☆38Nov 8, 2017Updated 8 years ago
knaw-huc / pagexml
View on GitHub
☆17Jan 16, 2026Updated 6 months ago
LanguageMachines / timbl
View on GitHub
TiMBL implements several memory-based learning algorithms.
☆55Jul 6, 2026Updated 2 weeks ago
marijnkoolen / fuzzy-search
View on GitHub
Fuzzy search modules for searching lists of words in low quality OCR and HTR text.
☆23Jun 29, 2026Updated 3 weeks ago
cohure / CoHuRe
View on GitHub
☆27Feb 2, 2021Updated 5 years ago
gariepyalex / reddit-nlp
View on GitHub
An exploration on natural language processing of reddit comments
☆10Nov 8, 2017Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ambiverse-nlu / ambiverse-kg
View on GitHub
Web Service wrapper for accessing the AmbiverseNLU KG stored in Neo4j
☆12Nov 16, 2022Updated 3 years ago
stjaenicke / GeoTemCo
View on GitHub
A tool for the comparative visualization of geospatial-temporal data.
☆22Mar 21, 2014Updated 12 years ago
ifding / flex-bison
View on GitHub
flex & bison (Lexical Analysis and Parsing)
☆12May 18, 2018Updated 8 years ago
performant-software / juxta-service
View on GitHub
Juxta Web Service
☆33Jul 7, 2022Updated 4 years ago
rtrppl / cuckoo-search
View on GitHub
Content-based search for Elfeed.
☆14Oct 4, 2025Updated 9 months ago
AttackingOrDefending / pydraughts
View on GitHub
A draughts (checkers) library for Python with move generation, PDN reading and writing, engine communication and balloted openings
☆20Jan 12, 2025Updated last year
jwiegley / org2tc
View on GitHub
convert org work clock entries to timeclock.el entries
☆13Mar 19, 2026Updated 4 months ago
cltl / svm_wsd
View on GitHub
Word Sense Disambiguation system developed on the DutchSemCor project using Support Vector Machines. The input is plain text, and the out…
☆12Feb 5, 2019Updated 7 years ago
tklauser / libtar
View on GitHub
Maintainance repo. Mirror of https://repo.or.cz/libtar.git with additional modifications
☆47Jun 20, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
robertdebock / docker-alpine-openrc
View on GitHub
Container to test Ansible roles in, including capabilities to use openrc facilities
☆11Sep 24, 2025Updated 9 months ago
BIMSBbioinfo / guix-bimsb-nonfree
View on GitHub
GNU Guix package definitions for proprietary software, or software with unclear licenses.
☆12Feb 20, 2025Updated last year
emacsmirror / consult-recoll
View on GitHub
Recoll queries using consult
☆15Apr 6, 2025Updated last year
ifitzpat / ob-kubectl
View on GitHub
Org babel extension to apply kubectl to org babel source blocks.
☆15Feb 13, 2020Updated 6 years ago
lepisma / onnx.el
View on GitHub
ONNX runtime for Emacs Lisp
☆14Aug 23, 2025Updated 10 months ago
kurtjx / n3-mode-for-emacs
View on GitHub
a major mode for emacs for editing n3 and turtle RDF
☆14Dec 13, 2017Updated 8 years ago
jaantollander / dotfiles-arch
View on GitHub
My Arch Linux setup for a lean, secure, command-line driven development environment with modular configuration management using shell scr…
☆10Jan 22, 2023Updated 3 years ago