proycon/folia

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/proycon/folia)

proycon / folia

FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…

☆66

Alternatives and similar repositories for folia

Users that are interested in folia are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

proycon / flat
View on GitHub
FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…
☆113Jan 24, 2025Updated last year
instituutnederlandsetaal / OpenConvert
View on GitHub
Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)
☆23Feb 11, 2022Updated 4 years ago
meertensinstituut / mtas
View on GitHub
Multi Tier Annotation Search
☆24May 12, 2021Updated 5 years ago
emanjavacas / cosycat
View on GitHub
Collaborative Synchronized Corpus Annotation Tool
☆10Dec 31, 2018Updated 7 years ago
opener-project / coreference-base
View on GitHub
Co-reference resolution for the English language.
☆17Jan 12, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
newsreader / eso-and-ceo
View on GitHub
Events and Situations Ontology
☆14Apr 20, 2018Updated 8 years ago
qurator-spk / sbb_ocr_postcorrection
View on GitHub
Two-Step Approach to OCR Post-Correction
☆14May 24, 2024Updated 2 years ago
tmbdev-teaching / teaching-nlpa
View on GitHub
Course in Natural Language Processing and Applications
☆10Oct 4, 2022Updated 3 years ago
LanguageMachines / ticcltools
View on GitHub
Tools for TICCL
☆14Dec 12, 2025Updated 7 months ago
proycon / LaMachine
View on GitHub
LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…
☆69Sep 11, 2023Updated 2 years ago
Princeton-CDH / piffle
View on GitHub
python library for working with IIIF Image and Presentation APIs
☆20Jun 15, 2026Updated last month
moxious / xml2neo
View on GitHub
Tools for converting/loading XML into neo4j
☆11Nov 24, 2018Updated 7 years ago
LanguageMachines / PICCL
View on GitHub
A set of workflows for corpus building through OCR, post-correction and normalisation
☆50Sep 7, 2022Updated 3 years ago
EuropeanaNewspapers / ner-app
View on GitHub
Named Entity Recognition tool for Europeana Newspapers
☆14Apr 5, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
brendano / OConnor_IREvents_ACL2013
View on GitHub
Replication software, data, and supplementary materials for the paper: O'Connor, Stewart and Smith, ACL-2013, "Learning to Extract Intern…
☆27Dec 14, 2020Updated 5 years ago
nikolamilosevic86 / Marvin
View on GitHub
Semantic text annotation tools using Wordnet and DBPedia
☆14Dec 14, 2017Updated 8 years ago
ryanfb / ancientgreekocr-ocr-evaluation-tools
View on GitHub
'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.
☆23Feb 21, 2018Updated 8 years ago
mittagessen / curt
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
cisocrgroup / Resources
View on GitHub
Manuals, lexica, OCR test data for PoCoTo and the profiler
☆15Jul 2, 2021Updated 5 years ago
CLARIAH / qber
View on GitHub
Crowd Based Coding and Harmonization using Linked Data
☆11Jan 22, 2018Updated 8 years ago
se4u / mvlsa
View on GitHub
Multiview LSA
☆11Jun 22, 2015Updated 11 years ago
glenrobson / iiif_stuff
View on GitHub
IIIF Examples and useful code
☆20Sep 10, 2025Updated 10 months ago
stefanklut / laypa
View on GitHub
Layout analysis to find layout elements in documents (similar to P2PaLA)
☆22May 20, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Pleias / marginalia
View on GitHub
☆67Mar 4, 2024Updated 2 years ago
cygri / vocidex
View on GitHub
Search over RDF schemas and OWL ontologies
☆11Sep 28, 2013Updated 12 years ago
pks / rebol
View on GitHub
Grounding statistical machine translation with semantic parsing
☆14May 13, 2015Updated 11 years ago
delph-in / zhong
View on GitHub
The zhong [|] Chinese grammars
☆15Mar 13, 2026Updated 4 months ago
semanticize / semanticizest
View on GitHub
Standalone Semanticizer
☆32Mar 4, 2015Updated 11 years ago
leondz / entity_recognition
View on GitHub
framework for doing NER and other types of entity recognition, in Python
☆68Jun 21, 2022Updated 4 years ago
emijrp / wikidata
View on GitHub
Scripts for Wikidata
☆21Jul 3, 2026Updated 3 weeks ago
jbhoward-dublin / iiif-imageManipulation
View on GitHub
Stand-alone implementation of UCD's IIIF image re-formatting tool + plugin to integrate with Mirador IIIF-compliant image viewer
☆18Jul 31, 2017Updated 8 years ago
quadrismegistus / lltk
View on GitHub
Literary Language Toolkit: code, models, corpora, and web tools
☆11Jul 5, 2026Updated 2 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
alltom / propagators
View on GitHub
JavaScript implementation of Radul and Sussman's Propagator model
☆15Jan 8, 2014Updated 12 years ago
andersjo / framenet-annotation
View on GitHub
Browser-based annotation tool for Framenet
☆16Jan 27, 2015Updated 11 years ago
KamalaSowmya / DiscussionSummarization
View on GitHub
Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…
☆12Apr 10, 2014Updated 12 years ago
jze / ocropus-model_fraktur
View on GitHub
OCRopus model for Gothic print (Fraktur)
☆19Feb 16, 2020Updated 6 years ago
sherlok / sherlok
View on GitHub
Distributed restful text mining.
☆23Jan 19, 2016Updated 10 years ago
versotym / rhymetagger
View on GitHub
A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…
☆34Jun 25, 2025Updated last year
allenai / unifew
View on GitHub
Unifew: Unified Fewshot Learning Model
☆18Sep 10, 2021Updated 4 years ago