FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for proces…
☆65Dec 9, 2025Updated 3 months ago
Alternatives and similar repositories for folia
Users that are interested in folia are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Jan 24, 2025Updated last year
- Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA)☆23Feb 11, 2022Updated 4 years ago
- Multi Tier Annotation Search☆26May 12, 2021Updated 4 years ago
- Collaborative Synchronized Corpus Annotation Tool☆11Dec 31, 2018Updated 7 years ago
- Co-reference resolution for the English language.☆17Jan 12, 2015Updated 11 years ago
- Events and Situations Ontology☆14Apr 20, 2018Updated 7 years ago
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic a…☆18Nov 18, 2024Updated last year
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆69Sep 11, 2023Updated 2 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- ☆11Feb 13, 2026Updated last month
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 7 years ago
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated 2 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Sep 7, 2022Updated 3 years ago
- Replication software, data, and supplementary materials for the paper: O'Connor, Stewart and Smith, ACL-2013, "Learning to Extract Intern…☆27Dec 14, 2020Updated 5 years ago
- Crowd Based Coding and Harmonization using Linked Data☆11Jan 22, 2018Updated 8 years ago
- Multiview LSA☆11Jun 22, 2015Updated 10 years ago
- Citar part of speech tagger☆39Mar 28, 2016Updated 9 years ago
- Design patterns for the ontology-lexicon interface using lemon and OWL☆21Jul 27, 2018Updated 7 years ago
- Data and preprocessing scripts for SemEval 2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding☆14Feb 3, 2022Updated 4 years ago
- ☆67Mar 4, 2024Updated 2 years ago
- A web-based programming environment for educational robotics that supports live coding and autonomy using a hybrid blocks/text programmin…☆20Jan 13, 2025Updated last year
- An XQuery 3.0 library for defining algebraic data types, and performing structural pattern matching on them.☆17Jun 30, 2012Updated 13 years ago
- Grounding statistical machine translation with semantic parsing☆14May 13, 2015Updated 10 years ago
- framework for doing NER and other types of entity recognition, in Python☆68Jun 21, 2022Updated 3 years ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 10 months ago
- Standalone Semanticizer☆32Mar 4, 2015Updated 11 years ago
- The zhong [|] Chinese grammars☆15Mar 13, 2026Updated last week
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- Browser-based annotation tool for Framenet☆16Jan 27, 2015Updated 11 years ago
- A PyData 2013 talk on straightforward, data-driven ways to handle natural language text in Python.☆51Oct 23, 2014Updated 11 years ago
- Distributed restful text mining.☆21Jan 19, 2016Updated 10 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆32Jun 25, 2025Updated 8 months ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Aug 3, 2011Updated 14 years ago
- Unifew: Unified Fewshot Learning Model☆18Sep 10, 2021Updated 4 years ago
- XSLT Functions for Transpect☆13Mar 16, 2026Updated last week
- A proofreading tool using Google's N-gram corpus.☆12Sep 2, 2022Updated 3 years ago
- The DDI Discovery Vocabulary, an RDF vocabulary for data description and discovery based on DDI☆25May 5, 2023Updated 2 years ago
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆135Mar 12, 2026Updated last week
- PyAnnotation is a Python Library to access and manipulate linguistically annotated corpus files.☆17Sep 4, 2012Updated 13 years ago