hunspell / zsort
sort lines of Hungarian text files using Hunspell morphological analysis with Magyar Ispell 1.7 language data, fixing known problems of collate algorithms of glibc and ICU/Unicode CLDR
☆16Updated 4 years ago
Alternatives and similar repositories for zsort:
Users that are interested in zsort are comparing it to the libraries listed below
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆17Updated this week
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆25Updated 6 months ago
- User contributed (non Google) OCR models for Tesseract☆24Updated 3 months ago
- Multi Tier Annotation Search☆12Updated 8 months ago
- Generate language n-gram statistics☆18Updated 2 years ago
- A tool to extract canonical references from text.☆20Updated 3 years ago
- Mirror of https://gerrit.wikimedia.org/g/purtle See https://www.mediawiki.org/wiki/Developer_access for contributing)☆10Updated 2 weeks ago
- fasttext with wheels and no external dependency, but only the predict method (<1MB)☆13Updated 2 months ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 4 years ago
- ⚽ Reliable message passing in distributed systems.☆14Updated last year
- libyui-ncurses☆19Updated 3 years ago
- Git mirror of sdbm source code ⛺☆26Updated 14 years ago
- PurePos is an open source hybrid morphological tagger.☆16Updated 4 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Updated 3 years ago
- The Free Lossless Audio Codec (FLAC) Specification.☆35Updated last year
- A small, simple hash table written in C.☆23Updated 13 years ago
- Lossless PDF squeezer☆23Updated 10 months ago
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Updated 5 months ago
- Tools to analyze web archives☆20Updated 8 years ago
- Use Go and WASM to create the basic triangle WebGL☆13Updated last year
- Samples for AV1 Video Codec☆10Updated 5 years ago
- A mini LDP Server written in Go.☆11Updated 8 years ago
- A statically typed binary tree in Go without casts or reflection☆19Updated 11 years ago
- The API is for anyone who wants to adopt best practices for a translation services API to interact with counterparts directly from your a…☆12Updated 8 years ago
- Yet Another Efficient Unification Algorithm☆26Updated 5 months ago
- Markdown for Linked Data☆16Updated 9 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated this week
- A web-based application to assist in geocoding of (mainly historical) datasets.☆9Updated last year
- Dependency manager for the Chaos language☆21Updated 4 years ago
- Variations on experimental Go clones of jq☆12Updated 2 years ago