unicode-org / unilex
Lexical data at Unicode
☆67Updated 5 months ago
Alternatives and similar repositories for unilex:
Users that are interested in unilex are comparing it to the libraries listed below
- Universal Declaration of Human Rights☆12Updated 3 months ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆41Updated 3 months ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Various pages and tools for working with non-Latin scripts☆36Updated this week
- A place to find and contribute examples of typographic features in text, especially from non-Latin scripts. Please read the instructions…☆22Updated 2 months ago
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆33Updated 2 months ago
- Jainī is a Devanāgarī and Latin typeface based on the calligraphic style of the Jain Kalpasūtra manuscripts.☆21Updated 10 months ago
- German part-of-speech dictionary☆43Updated last year
- The Data Format for Digital Linguistics (DaFoDiL)☆22Updated 2 years ago
- Crawler for linguistic corpora☆199Updated last year
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- A web framework to display Cross Linguistic Linked Data.☆55Updated this week
- Manage a set of language tag equivalence sets☆14Updated this week
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated 2 months ago
- Python Finite-State Toolkit☆50Updated last month
- font development, testing and release☆14Updated 2 weeks ago
- PhiloLogic4☆38Updated 2 months ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆46Updated 3 months ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆13Updated 5 years ago
- CLDF: Cross-Linguistic Data Formats - the specification☆57Updated 10 months ago
- A HarfBuzz Python binding☆71Updated last month
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Helsinki Finite-State Technology (library and application suite)☆128Updated 3 weeks ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- The curation repository for the data behind Concepticon.☆37Updated this week
- A PDF library extracted from TeX's dvipdfmx☆25Updated 5 months ago
- PHOIBLE Online☆42Updated 2 years ago
- unicodedata backport/updates☆36Updated last month
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Find languages that use a given non-ASCII character, or find characters used by a particular language.☆17Updated 2 months ago