emanjavacas/pie

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/emanjavacas/pie)

emanjavacas / pie

A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.

☆25

Alternatives and similar repositories for pie

Users that are interested in pie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

avjves / textreuse-blast
View on GitHub
A software to detect text reuse with BLAST.
☆13Oct 8, 2019Updated 6 years ago
hipster-philology / nlp-pie-taggers
View on GitHub
Extension for pie to include taggers with their models and pre/postprocessors
☆11Jun 23, 2026Updated last month
hipster-philology / pyrrha
View on GitHub
A language-independent post-correction app for POS-tagging and lemmatization
☆30Jun 17, 2026Updated last month
mikekestemont / copia
View on GitHub
Bias correction for richness in abundance data
☆13Apr 20, 2026Updated 3 months ago
SegmOnto / Guidelines
View on GitHub
☆14Apr 19, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
cneud / alto-tools
View on GitHub
Python tools for performing various operations on ALTO XML files
☆50Jun 12, 2026Updated last month
coastalcph / histnorm
View on GitHub
Compiled tools, datasets, and other resources for historical text normalization.
☆21Jun 18, 2019Updated 7 years ago
cohure / CoHuRe
View on GitHub
☆27Feb 2, 2021Updated 5 years ago
NathanGodey / headless-lm
View on GitHub
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆29Apr 17, 2024Updated 2 years ago
HTR-United / cremma-medieval
View on GitHub
Transcription corpora for training HTR models for medieval manuscripts from the 12th to the 15th century.
☆25Jan 17, 2025Updated last year
ilmucio / vim-machine
View on GitHub
A repo to share idea on customize a machine for the use of vim.
☆13May 7, 2020Updated 6 years ago
evt-project / evt-builder
View on GitHub
Edition Visualization Technology 1 - XSLT Builder
☆15Oct 23, 2024Updated last year
riedlma / sequence_tagging
View on GitHub
Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German
☆26May 10, 2021Updated 5 years ago
emanjavacas / cosycat
View on GitHub
Collaborative Synchronized Corpus Annotation Tool
☆10Dec 31, 2018Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
CIRCSE / LEMLAT3
View on GitHub
Morphological analyzer and lemmatizer for Latin.
☆29May 22, 2026Updated 2 months ago
KorAP / Tokenizer-Evaluation
View on GitHub
Benchmark scripts for comparing different tokenizers and sentence segmenters of German
☆12Feb 27, 2023Updated 3 years ago
cjbayron / artist2lyrics
View on GitHub
Lyrics crawling, pre-processing, embedding generation, model training, and lyrics generation - all in one tool
☆14Nov 4, 2018Updated 7 years ago
UniversalDependencies / UD_German-HDT
View on GitHub
☆14May 29, 2026Updated last month
ComplexNetTSP / MultilayerParis
View on GitHub
Paris multilayer transport network
☆11Sep 13, 2021Updated 4 years ago
omni-us / pagexml
View on GitHub
Library in C++ and a python wrapper for dealing with Page XML files
☆13Apr 25, 2025Updated last year
pharos-alexandria / ocr-greek_cursive
View on GitHub
Training files for Greek cursive script (in early print)
☆15May 26, 2021Updated 5 years ago
dhfbk / Histo
View on GitHub
☆15Jan 9, 2019Updated 7 years ago
dbmdz / historic-ner
View on GitHub
Repository for "Towards Robust Named Entity Recognition for Historic German"
☆18Dec 11, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
noi-techpark / stuart-chatbot
View on GitHub
Stuart is simple RAG System, that the Open Data Hub uses as a chatbot to help the Open Data Hub customer care team in solving tickets. St…
☆14May 18, 2026Updated 2 months ago
impresso / named-entity-tutorial-dh2019
View on GitHub
Tutorial on NE processing for Digital Humanities - DH Utrech 2019
☆24Jul 18, 2019Updated 7 years ago
fbkarsdorp / alignment
View on GitHub
Simple Python library for doing (multiple) sequence alignment
☆17Jun 24, 2018Updated 8 years ago
GarfieldLyu / OCR_POST_DE
View on GitHub
OCR post correction for old German corpus
☆20Aug 29, 2022Updated 3 years ago
altoxml / schema
View on GitHub
ALTO XML schema - latest and all former versions
☆55Jul 8, 2026Updated 2 weeks ago
SunoikisisDC / SunoikisisDC-2016-2017
View on GitHub
Planning Seminar and 2016-2017 WS and SS Courses
☆10Mar 20, 2019Updated 7 years ago
clarinsi / csmtiser
View on GitHub
A tool for text normalisation via character-level machine translation
☆13Jun 12, 2020Updated 6 years ago
dbamman / latin-bert
View on GitHub
Latin BERT
☆75Jun 27, 2024Updated 2 years ago
lattice-8094 / fr-litbank
View on GitHub
A french litbank corpus
☆10Jan 22, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
melaniewalsh / AI-4-Humanists
View on GitHub
The GitHub repository for the AI for Humanists Project
☆21Jun 9, 2025Updated last year
proycon / python-ucto
View on GitHub
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…
☆32Feb 2, 2026Updated 5 months ago
dasmiq / passim
View on GitHub
Detect and align similar passages
☆122Apr 27, 2026Updated 2 months ago
dcthree / antigrapheus
View on GitHub
In-browser OCR of Ancient Greek and Latin
☆27Updated this week
marijnkoolen / fuzzy-search
View on GitHub
Fuzzy search modules for searching lists of words in low quality OCR and HTR text.
☆23Jun 29, 2026Updated 3 weeks ago
gz / phdplot
View on GitHub
Make nice plots with matplotlib.
☆11Oct 8, 2019Updated 6 years ago
ryanfb / ancientgreekocr-ocr-evaluation-tools
View on GitHub
'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.
☆23Feb 21, 2018Updated 8 years ago