multilingual-dh / nlp-resources
Natural language processing resources for multiple languages, with an eye towards use for digital humanities.
☆124Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for nlp-resources
- Digital Humanities Across Borders☆46Updated 8 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated last year
- High-performance text aligner for large collections of texts☆45Updated 3 weeks ago
- A Python library for topic modeling and visualization☆64Updated 4 years ago
- Detect and align similar passages☆88Updated 2 months ago
- This is code that we will cover in my Hacking the Humanities class at Leiden University. Video tutorials will be uploaded to my YouTube c…☆30Updated 6 years ago
- A collection of Jupyter notebooks in many human and computer languages for doing digital humanities. PRs welcome!☆124Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆77Updated 9 months ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆39Updated 2 years ago
- Python 3 library for processing historical English☆64Updated 3 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 3 years ago
- A tool for automatic spelling normalization☆20Updated 3 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 2 years ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆12Updated 2 years ago
- ☆28Updated 3 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆27Updated 5 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago
- Practical Approaches to Data Science with Text☆38Updated 4 years ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆38Updated 5 years ago
- Data for the HIPE 2022 shared task.☆16Updated 11 months ago
- Project on the history of genre.☆22Updated 4 years ago
- Explore your own text collection with a topic model – without prior knowledge.☆62Updated 3 weeks ago
- Text collections made available by the CLiGS group.☆22Updated 2 years ago
- I.PHI dataset generation☆25Updated 11 months ago
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- A simple toolkit for conducting analyses using corpus methods☆24Updated 3 years ago
- A command-line program to download text corpora.☆33Updated 7 years ago
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆10Updated 2 years ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated 6 months ago