coastalcph / histnormView external linksLinks
Compiled tools, datasets, and other resources for historical text normalization.
☆20Jun 18, 2019Updated 6 years ago
Alternatives and similar repositories for histnorm
Users that are interested in histnorm are comparing it to the libraries listed below
Sorting:
- A tool for automatic spelling normalization☆21Jan 18, 2021Updated 5 years ago
- Digitale Geisteswissenschaften rund um Graphentechnologien☆10Updated this week
- ☆15Aug 14, 2018Updated 7 years ago
- A tool for text normalisation via character-level machine translation☆13Jun 12, 2020Updated 5 years ago
- Data for the HIPE 2022 shared task.☆21Nov 29, 2023Updated 2 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Oct 27, 2023Updated 2 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- Finite-state script normalization and processing utilities☆46Jan 14, 2026Updated last month
- ☆32Sep 27, 2021Updated 4 years ago
- Libraries, Archives and Museums (LAM)☆88Oct 4, 2022Updated 3 years ago
- ☆10Feb 2, 2021Updated 5 years ago
- A parallel evaluation data set of SAP software documentation with document structure annotation☆14Jul 30, 2025Updated 6 months ago
- Modified Editorial website template. Based on Editorial theme in html5up.net, adapted for Jekyll by Andrew Bancich.☆11Jun 10, 2024Updated last year
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Dec 5, 2021Updated 4 years ago
- German GPT-2 model☆32Aug 17, 2021Updated 4 years ago
- Python source code for EMNLP 2020 paper "Reusing a Pretrained Language Model on Languages with Limited Corpora for Unsupervised NMT".☆35Mar 16, 2022Updated 3 years ago
- Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.☆11Aug 14, 2019Updated 6 years ago
- Creating crowdsourcing based experiments made easy☆10May 25, 2020Updated 5 years ago
- Canonical normalizing flows☆10Apr 30, 2019Updated 6 years ago
- MATLAB code for Stein Point Markov Chain Monte Carlo.☆13Jul 3, 2019Updated 6 years ago
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- PowerShell scripts for processing content into CONTENTdm load packages, batch editing, and batch re-ocr.☆11Jun 2, 2023Updated 2 years ago
- Linear Attention for Efficient Bidirectional Sequence Modeling☆15May 13, 2025Updated 9 months ago
- Identifying Nuances in Fake News vs. Satire: Using Semantic and Linguistic Cues (NLP4IF, EMNLP-IJCNLP 2019)☆11Dec 21, 2020Updated 5 years ago
- Linguistic Reconstruction with LingPy☆15Aug 5, 2024Updated last year
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆40Dec 12, 2025Updated 2 months ago
- ☆52Aug 18, 2024Updated last year
- ☆11Mar 25, 2024Updated last year
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆11Sep 26, 2022Updated 3 years ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Jul 11, 2023Updated 2 years ago
- f("A1") = 𓀀; also A1.png☆12Jul 19, 2025Updated 6 months ago
- Code for our paper Re-balancing Variational Autoencoder Loss for Molecule Sequence Generation.☆11Sep 4, 2022Updated 3 years ago
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 8 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Feb 4, 2025Updated last year
- Collection of description of concepts, procedures, and simple XSLT files for text processing, e.g. simplify InDesign documents (.idml) to…☆12Jan 9, 2020Updated 6 years ago
- ☆10Sep 13, 2022Updated 3 years ago
- ☆13Nov 28, 2025Updated 2 months ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago