unicode-org / unilex
Lexical data at Unicode
☆67Updated 4 months ago
Alternatives and similar repositories for unilex:
Users that are interested in unilex are comparing it to the libraries listed below
- Universal Declaration of Human Rights☆11Updated 2 months ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- Collaborative data curation for Glottolog☆154Updated this week
- CLDF: Cross-Linguistic Data Formats - the specification☆56Updated 9 months ago
- A web framework to display Cross Linguistic Linked Data.☆55Updated 2 months ago
- The Data Format for Digital Linguistics (DaFoDiL)☆22Updated last year
- Crawler for linguistic corpora☆197Updated last year
- Various pages and tools for working with non-Latin scripts☆36Updated 2 weeks ago
- The Global WordNet Association Collaborative Inter-Lingual Index☆41Updated 2 months ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated last month
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆89Updated last year
- The World Atlas of Language Structures☆56Updated 3 months ago
- Manage a set of language tag equivalence sets☆14Updated this week
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆15Updated 2 weeks ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- A place to find and contribute examples of typographic features in text, especially from non-Latin scripts. Please read the instructions…☆22Updated 3 weeks ago
- Helsinki Finite-State Technology (library and application suite)☆125Updated this week
- eXtensible Interlinear Glossed Text☆32Updated 2 years ago
- The curation repository for the data behind Concepticon.☆37Updated this week
- PhiloLogic4☆38Updated last month
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆73Updated last month
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆13Updated 5 years ago
- Jainī is a Devanāgarī and Latin typeface based on the calligraphic style of the Jain Kalpasūtra manuscripts.☆20Updated 9 months ago
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆32Updated 3 weeks ago
- Python Finite-State Toolkit☆47Updated last week
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆23Updated last year
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆66Updated last month
- SegBo: A database of borrowed sounds in the world’s languages☆16Updated 9 months ago