An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic annotation finding application in Natural Language Processing (NLP). This library was formerly part of PyNLPl.
☆18Nov 18, 2024Updated last year
Alternatives and similar repositories for foliapy
Users that are interested in foliapy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- FoLiA library for C++☆17Mar 25, 2026Updated last month
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Sep 7, 2022Updated 3 years ago
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆69Sep 11, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Mar 25, 2026Updated last month
- A lexical normalizer for historical spelling variants using a transformer architecture.☆10Mar 12, 2025Updated last year
- Guidelines for software quality & sustainability (CLARIAH WP2 task 54.100)☆18May 29, 2022Updated 3 years ago
- Multi Tier Annotation Search☆12May 13, 2024Updated last year
- An NLP-suite powered by deep learning☆19Mar 24, 2023Updated 3 years ago
- SFST/SMOR/DWDS-based German Morphology☆21Updated this week
- DM is an environment for the study and annotation of images and texts. It is a suite of tools, enabling scholars to gather and organize t…☆19Dec 10, 2018Updated 7 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Dec 9, 2025Updated 4 months ago
- Performs pairwise preference ranking for a given trainfile and testfile with binary class labels (1 and not 1). The binary classification…☆14Jul 12, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A tool for the comparative visualization of geospatial-temporal data.☆22Mar 21, 2014Updated 12 years ago
- validates data imports in analysis pipeline☆10Jan 19, 2023Updated 3 years ago
- ☆21Sep 11, 2017Updated 8 years ago
- TiMBL implements several memory-based learning algorithms.☆54Mar 12, 2026Updated last month
- Fuzzy search modules for searching lists of words in low quality OCR and HTR text.☆23Mar 30, 2026Updated last month
- An exploration on natural language processing of reddit comments☆10Nov 8, 2017Updated 8 years ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Simple spaCy-based concept extraction API, involving a dictionary of relevant concepts.☆10May 15, 2019Updated 6 years ago
- ☆27Feb 2, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Django app to import and categorise financial transactions☆13Jun 10, 2021Updated 4 years ago
- convert org work clock entries to timeclock.el entries☆13Mar 19, 2026Updated last month
- Reproducible research paper in the journal Archaeology in Oceania☆16Jan 18, 2012Updated 14 years ago
- Recoll queries using consult☆15Apr 6, 2025Updated last year
- an approximate string matching or fuzzy-matching system for spelling correction, normalisation or post-OCR correction (mirror of https://…☆39Feb 10, 2026Updated 2 months ago
- GNU Guix package definitions for proprietary software, or software with unclear licenses.☆12Feb 20, 2025Updated last year
- Content-based search for Elfeed.☆14Oct 4, 2025Updated 6 months ago
- Succeeded by syntaxdot-transformers: https://github.com/tensordot/syntaxdot/tree/main/syntaxdot-transformers☆19Oct 7, 2020Updated 5 years ago
- ONNX runtime for Emacs Lisp☆14Aug 23, 2025Updated 8 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆26Nov 27, 2021Updated 4 years ago
- My Arch Linux setup for a lean, secure, command-line driven development environment with modular configuration management using shell scr…☆10Jan 22, 2023Updated 3 years ago
- A draughts (checkers) library for Python with move generation, PDN reading and writing, engine communication and balloted openings☆20Jan 12, 2025Updated last year
- a major mode for emacs for editing n3 and turtle RDF☆14Dec 13, 2017Updated 8 years ago
- Org babel extension to apply kubectl to org babel source blocks.☆15Feb 13, 2020Updated 6 years ago
- An HP ILO Prometheus Exporter☆13Nov 16, 2022Updated 3 years ago
- Mirror of https://git.tecosaur.net/tec/org-music☆11Aug 22, 2022Updated 3 years ago