A lexical normalizer for historical spelling variants using a transformer architecture.
☆10Mar 12, 2025Updated last year
Alternatives and similar repositories for transnormer
Users that are interested in transnormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SFST/SMOR/DWDS-based German Morphology☆21Updated this week
- Multi Tier Annotation Search☆12May 13, 2024Updated last year
- FairCopy is a word processor for the humanities scholar.☆13Jan 26, 2026Updated 2 months ago
- This repository contain the implementation of DANIEL. (A fast Document Attention Network for Information Extraction and Labeling of handw…☆21Jan 12, 2026Updated 3 months ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 🇩🇪 Preprocess German texts to do some serious natural-language processing.☆12Dec 9, 2022Updated 3 years ago
- Web Content Extraction Benchmark☆23Dec 16, 2025Updated 3 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic a…☆18Nov 18, 2024Updated last year
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- DM is an environment for the study and annotation of images and texts. It is a suite of tools, enabling scholars to gather and organize t…☆19Dec 10, 2018Updated 7 years ago
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆15Dec 6, 2025Updated 4 months ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- SapiMouse - a new dataset for Mouse Dynamics☆21Dec 28, 2022Updated 3 years ago
- CERberus -- guardian against character errors☆29Feb 15, 2024Updated 2 years ago
- A documentation for FAIR GPT, a virtual RDM consultant☆15Oct 10, 2024Updated last year
- QualiAnon is a tool to support the anonymization of text data. It is developed by the Qualiservice research data center for the anonymiza…☆35Feb 16, 2026Updated last month
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.