A lexical normalizer for historical spelling variants using a transformer architecture.
☆10Mar 12, 2025Updated 11 months ago
Alternatives and similar repositories for transnormer
Users that are interested in transnormer are comparing it to the libraries listed below
Sorting:
- SFST/SMOR/DWDS-based German Morphology☆21Feb 2, 2026Updated last month
- Multi Tier Annotation Search☆12May 13, 2024Updated last year
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- Translation of query languages to serialized KoralQuery protocol☆13Feb 23, 2026Updated last week
- Precise hotword listener on Tract and Rust☆12Aug 6, 2022Updated 3 years ago
- ☆10Oct 2, 2021Updated 4 years ago
- Zap file format compatible with a future version of Bleve☆13Updated this week
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated last year
- 🇩🇪 Preprocess German texts to do some serious natural-language processing.☆12Dec 9, 2022Updated 3 years ago
- A tiny graph database engine written in C☆10May 9, 2014Updated 11 years ago
- Static Huffman coding☆10Apr 3, 2017Updated 8 years ago
- ☆11Feb 13, 2026Updated 2 weeks ago
- Simple cross-process mutex based on file locks☆10Sep 14, 2017Updated 8 years ago
- shoco is a compressor for small text strings. [Not maintained].☆10Sep 4, 2019Updated 6 years ago
- ☆13Apr 14, 2024Updated last year
- A very simple web UI for Mycroft☆10Aug 6, 2021Updated 4 years ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Collection of Python Scripts that Allow Open Web UI to Interact with External APIs☆12Apr 4, 2025Updated 10 months ago
- reddit search tool using the pushift.io API☆14Sep 17, 2024Updated last year
- Experimental text shaping in LuaTeX using Harfbuzz library☆10Jul 17, 2018Updated 7 years ago
- ☆13Oct 2, 2011Updated 14 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- commentary and notes from a 42 year old bar owner trying to understand all this. Newcomers start here.☆12Sep 3, 2022Updated 3 years ago
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆14Dec 6, 2025Updated 2 months ago
- Compressed double-array tries for static string dictionaries☆11May 9, 2019Updated 6 years ago
- TensorFlow Lite example on a Raspberry Pi Zero W☆11Dec 16, 2020Updated 5 years ago
- This repository contain the implementation of DANIEL. (A fast Document Attention Network for Information Extraction and Labeling of handw…☆20Jan 12, 2026Updated last month
- Telegram bot for sending files to chat or channel by cron.☆12Feb 27, 2018Updated 8 years ago
- Harfbuzz bindings for Lua☆12Dec 9, 2025Updated 2 months ago
- FairCopy is a word processor for the humanities scholar.☆13Jan 26, 2026Updated last month
- Mojolicious plugin to make it a little easier to implement an OAuth2 authorization/resource server☆11May 20, 2025Updated 9 months ago
- Distributed KV store using go-ds-crdt and libp2p☆12Nov 28, 2021Updated 4 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Multi-arch graphs1090 container for visualising ADSB reception stats (amd64, arm/v6, arm/v7, arm64)☆12Jan 9, 2021Updated 5 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Generate a list of the top 25 value stocks on the market.☆14May 16, 2017Updated 8 years ago
- A new lossless data compression algorithm☆12Nov 19, 2025Updated 3 months ago
- Lua implementation of the Unicode Bidirectional Algorithm☆10Jul 27, 2017Updated 8 years ago