The Hanover Tagger - A simple approach to lemmatization and POS-tagging of German morphology based on heuristics and hidden markov models
☆56Mar 21, 2025Updated 11 months ago
Alternatives and similar repositories for HanTa
Users that are interested in HanTa are comparing it to the libraries listed below
Sorting:
- Python port for IWNLP.Lemmatizer☆18Oct 18, 2023Updated 2 years ago
- This repository contains all manually labeled data from the GermEval-2018 shared task.☆29Sep 28, 2018Updated 7 years ago
- GermaParl R Data Package☆14Aug 31, 2022Updated 3 years ago
- Open Discourse is the first fully comprehensive corpus of the plenary proceedings of the federal German Parliament (Bundestag).☆105Feb 12, 2025Updated last year
- IWNLP: A parser for the German edition of Wiktionary☆13Jul 28, 2023Updated 2 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Dec 11, 2020Updated 5 years ago
- German lemmatization with IWNLP as extension for spaCy☆26Jul 28, 2023Updated 2 years ago
- 📑 Python Package to reconstruct the original continuous text from PDFs with language models☆32Sep 8, 2023Updated 2 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆518Oct 30, 2024Updated last year
- ☆35Dec 26, 2022Updated 3 years ago
- A Streamlit component for annotating text by text selecting.☆41Jun 12, 2024Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Mar 8, 2022Updated 3 years ago
- Linked Open Data Development at the Canadian Heritage Information Network - Développement en données ouvertes et liées au Réseau canadien…☆12Jul 17, 2024Updated last year
- Repository for the course "JavaScript Object Oriented Programming"☆11Jun 30, 2019Updated 6 years ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- Areal images sourced from the FIS-Broker, City of Berlin.☆13Nov 10, 2025Updated 3 months ago
- Data profiling tools for Big Data☆11Nov 17, 2025Updated 3 months ago
- ☆11Jan 27, 2026Updated last month
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 7 months ago
- Implements Global Word Vectors.☆11Feb 8, 2020Updated 6 years ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11May 19, 2022Updated 3 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆186Jun 6, 2025Updated 9 months ago
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 3 months ago
- 青空文庫のテキストファイル☆14Feb 4, 2024Updated 2 years ago
- Simulating Imperial Dynamics and Conflict in the Ancient World☆11Oct 3, 2021Updated 4 years ago
- 基于 Slidev 的 Dify 平台插件,可以将 Markdown 内容一键转换为 PPT 演示文稿。☆14Jul 15, 2025Updated 7 months ago
- Sentiment Analysis in Japanese. sentiment_ja with JavaScript☆10Apr 1, 2022Updated 3 years ago
- Zunda: Japanese Enhanced Modality Analyzer client for Python.☆10Nov 30, 2019Updated 6 years ago
- List of people, organisations, groups, … doing datavis in Berlin☆11Feb 20, 2026Updated 2 weeks ago
- Collect, discuss and manage feedback on OntoME☆12Dec 7, 2023Updated 2 years ago
- Crawler that collects and extracts content of daily published news articles☆12Feb 18, 2023Updated 3 years ago
- Demo for the Speakeasy Node.js package.☆11Jan 27, 2016Updated 10 years ago
- Alternative and lite implementation of Hoogle☆11Apr 9, 2024Updated last year
- 🔎 Finds fuzzy matches between datasets☆16Jan 26, 2026Updated last month
- 🌸De-inflect Japanese words☆15Nov 24, 2025Updated 3 months ago
- Javascript toplevel worker☆14Feb 5, 2026Updated last month
- ☆20Jun 25, 2013Updated 12 years ago
- thread-local storage for OCaml☆17Jan 13, 2025Updated last year
- A NativeScript Range Seek Bar widget.☆10Jan 14, 2023Updated 3 years ago