ieg-dhr/NLP-Course4Humanities_2024

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ieg-dhr/NLP-Course4Humanities_2024)

ieg-dhr / NLP-Course4Humanities_2024

This repository is part of an NLP course for humanities and cultural studies. This course uses historical newspapers as a source and applies NLP methods to them. NLP tasks: Tokenization, Lemmatization, TF-IDF, Part-of-speech tagging, semantic search with transformers, article extraction and OCR post-correction with LLMs, NER and text classificat…

☆20

Alternatives and similar repositories for NLP-Course4Humanities_2024

Users that are interested in NLP-Course4Humanities_2024 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Pleias / OCRoscope
View on GitHub
Small python package to measure OCR quality and other related metrics.
☆26Feb 19, 2024Updated 2 years ago
qurator-spk / sbb_ner
View on GitHub
Named Entity Recognition
☆19Feb 13, 2026Updated 5 months ago
acdh-oeaw / vocabseditor
View on GitHub
Vocabseditor is a web-based tool for collaborative work on controlled vocabularies development
☆24Sep 4, 2025Updated 10 months ago
KRR-Oxford / HierarchyTransformers
View on GitHub
Language Models as Hierarchy Encoders
☆43Jan 6, 2026Updated 6 months ago
mapping-commons / rda-fair-mappings
View on GitHub
Managing the progress for the RDA Working Group on Fair Mappings (https://www.rd-alliance.org/groups/fair-mappings-wg/).
☆11Jun 9, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
doriantaylor / rb-rdf-lmdb
View on GitHub
Symas (OpenLDAP) LMDB back-end for RDF::Repository
☆17Jun 29, 2026Updated 3 weeks ago
arthurxlw / cytonMt
View on GitHub
CytonMT: an Efficient Neural Machine Translation Open-source Toolkit Implemented in C++
☆21Oct 28, 2018Updated 7 years ago
bltlab / mot
View on GitHub
Multilingual Open Text
☆26May 8, 2025Updated last year
eyereasoner / eyeling
View on GitHub
A Notation3 (N3) reasoner in JavaScript.
☆18Updated this week
LinkedPasts / linked-traces-format
View on GitHub
Patterns based on the W3C Web Annotation Model, primarily for use in linking resources describing historical phenomena with the places re…
☆16Mar 6, 2020Updated 6 years ago
Knowledgator / GLiNER.js
View on GitHub
GLiNER inference in JavaScript
☆27Mar 2, 2025Updated last year
UriSha / EmbeddinglessNMT
View on GitHub
The implementation of "Neural Machine Translation without Embeddings", NAACL 2021
☆33Jun 9, 2021Updated 5 years ago
TheScienceMuseum / heritage-connector
View on GitHub
Heritage Connector: Transforming text into data to extract meaning and make connections
☆27Feb 14, 2023Updated 3 years ago
namisan / exdeep-nmt
View on GitHub
☆32Sep 27, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
texttechnologylab / GerParCor
View on GitHub
German Parliamentary Corpus (GerParCor)
☆32Mar 29, 2026Updated 3 months ago
gbv / bartoc.org
View on GitHub
Source code of BARTOC.org user interface
☆29Jul 13, 2026Updated last week
FAIRDataTeam / OpenRefine-metadata-extension
View on GitHub
Extension for OpenRefine to support FAIR metadata
☆25Dec 6, 2022Updated 3 years ago
maxdotio / mighty-batch
View on GitHub
Highly concurrent and fast content processing for Mighty Inference Server
☆10Feb 6, 2023Updated 3 years ago
Yuanhy1997 / HyPe
View on GitHub
HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]
☆14Jul 11, 2023Updated 3 years ago
skohub-io / skohub-pages
View on GitHub
☆21Feb 10, 2026Updated 5 months ago
CharizardAcademy / convtransformer
View on GitHub
Code for the ACL2020 paper Character-Level Translation with Self-Attention
☆31Oct 15, 2020Updated 5 years ago
philschmid / multilingual-serverless-qa-aws-lambda
View on GitHub
☆10Dec 17, 2020Updated 5 years ago
TEI-Boilerplate / TEI-Boilerplate
View on GitHub
☆92Dec 12, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
ottowg / gsap-ner
View on GitHub
☆10Oct 2, 2024Updated last year
cisnlp / mPLM-Sim
View on GitHub
mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models
☆11Jan 19, 2024Updated 2 years ago
stefan-it / ukrainian-electra
View on GitHub
Ukrainian ELECTRA model
☆12Mar 11, 2023Updated 3 years ago
google-research / nisaba
View on GitHub
Finite-state script normalization and processing utilities
☆52Jun 24, 2026Updated 3 weeks ago
TIGER-AI-Lab / PixelWorld
View on GitHub
The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆15Sep 12, 2025Updated 10 months ago
honnibal / py-clearnlp-converter
View on GitHub
A simple Python wrapper for the ClearNLP constituents-to-dependencies converter
☆11Nov 2, 2015Updated 10 years ago
WangFei-2019 / SNARE
View on GitHub
Project for SNARE benchmark
☆11Jun 5, 2024Updated 2 years ago
flairNLP / familiarity
View on GitHub
Label shift estimation for transfer difficulty with Familiarity.
☆10Feb 4, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DjangoPeng / leetcode-solutions
View on GitHub
The Python solutions of leetcode
☆14Apr 26, 2020Updated 6 years ago
bicici / FDA
View on GitHub
Feature Decay Algorithms
☆11Mar 5, 2014Updated 12 years ago
sparna-git / xls2rdf
View on GitHub
Create RDF data from Excel spreadsheets - edit SKOS vocabularies, knowledge graph instances, SHACL constraints, OWL ontologies in Excel f…
☆29Jul 3, 2026Updated 2 weeks ago
XinbangZhang / DATA-NAS
View on GitHub
Codes for DATA: Differentiable ArchiTecture Approximation.
☆11Jul 22, 2021Updated 5 years ago
quadrismegistus / lltk
View on GitHub
Literary Language Toolkit: code, models, corpora, and web tools
☆11Jul 5, 2026Updated 2 weeks ago
hannahxchen / automatic-paraphrase-dataset-augmentation
View on GitHub
Code and data for automatic paraphrase dataset augmentation.
☆11Mar 8, 2021Updated 5 years ago
huggingface / spm_precompiled
View on GitHub
Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`
☆22Jun 9, 2026Updated last month