Literary Language Toolkit: code, models, corpora, and web tools
☆11Mar 28, 2024Updated last year
Alternatives and similar repositories for lltk
Users that are interested in lltk are comparing it to the libraries listed below
Sorting:
- ☆27Feb 2, 2021Updated 5 years ago
- A simple vector space model based tool for sentiment analysis of literary texts☆18Sep 17, 2024Updated last year
- Using machine learning to classify book reviews based on genre☆11Dec 5, 2017Updated 8 years ago
- A context-based spellchecker for correcting OCR output.☆21Feb 3, 2023Updated 3 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆23Aug 29, 2022Updated 3 years ago
- Multilingual Open Text☆25May 8, 2025Updated 9 months ago
- Extraction of structured and unstructured information from fandom.com pages☆27Feb 22, 2025Updated last year
- German Parliamentary Corpus (GerParCor)☆30Jan 14, 2026Updated last month
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Mar 21, 2022Updated 3 years ago
- Neural Language Models for Historical Research☆29Oct 16, 2024Updated last year
- This is code that we will cover in my Hacking the Humanities class at Leiden University. Video tutorials will be uploaded to my YouTube c…☆33Oct 26, 2018Updated 7 years ago
- ☆36Jul 7, 2025Updated 7 months ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆40Nov 13, 2025Updated 3 months ago
- European Parliament website Python scraper☆12Oct 19, 2016Updated 9 years ago
- https://sites.google.com/site/multidimensionaltagger☆38Dec 6, 2023Updated 2 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Mar 8, 2022Updated 3 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- ☆11Jun 18, 2023Updated 2 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- ☆14Feb 19, 2024Updated 2 years ago
- Dutch abusive language data☆11Sep 23, 2023Updated 2 years ago
- MG top-down beam parsing☆13Jul 2, 2018Updated 7 years ago
- Shell script to manage multiple Microsoft Teams profiles on Linux.☆12Mar 3, 2021Updated 5 years ago
- ☆12Dec 14, 2022Updated 3 years ago
- The Swiss Court Ruling Corpus (SCRC) contains code for extracting information from Swiss court rulings☆11Jan 22, 2025Updated last year
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- Codebase for "Decoding language spatial relations to 2D spatial arrangements" (Findings of EMNLP 2020).☆11Feb 10, 2023Updated 3 years ago
- Make a searchable pdf via Google Cloud Vision OCR☆14Jan 17, 2020Updated 6 years ago
- ☆12Jan 31, 2015Updated 11 years ago
- Word embeddings trained on medical subreddits.☆10Jan 4, 2021Updated 5 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆13Jul 1, 2024Updated last year
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding☆10Jul 15, 2023Updated 2 years ago
- Within-book topic modeling on HTRC feature extraction files☆23May 3, 2016Updated 9 years ago
- ☆15Aug 19, 2024Updated last year
- This repository contains the dataset and implementation details of the paper "An In-depth Analysis of Implicit and Subtle Hate Speech Mes…☆10May 9, 2024Updated last year
- Python client library for the ClamAV antivirus.☆12May 15, 2025Updated 9 months ago
- A Python helper library to convert between ISO 639 two- and three-letter codes.☆11Nov 13, 2024Updated last year
- ☆11Jul 12, 2021Updated 4 years ago
- Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."☆10Jan 26, 2020Updated 6 years ago