juditacs / wikt2dictLinks
Wiktionary parser tool for many language editions.
☆54Updated 3 years ago
Alternatives and similar repositories for wikt2dict
Users that are interested in wikt2dict are comparing it to the libraries listed below
Sorting:
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 6 years ago
- A multilingual parallel corpus created from translations of the Bible.☆191Updated 7 months ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆198Updated 5 years ago
- bilingual dictionary extractor from parallel corpora☆23Updated 11 years ago
- Morphological Dictionaries for German Language☆30Updated 7 years ago
- German Morphological Analyzer☆51Updated 4 years ago
- Python Finite-State Toolkit☆60Updated last month
- The Open Multilingual Wordnet☆66Updated last year
- Machine-readable Wiktionary☆77Updated last year
- Sentence aligner☆122Updated 4 years ago
- ☆23Updated 8 years ago
- The curation repository for the data behind Concepticon.☆42Updated last week
- A cloud-based, open-source system for writing and publishing dictionaries.☆98Updated last year
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language☆16Updated this week
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆19Updated last week
- Collaborative on-line editor for aligned parallel texts.☆13Updated last month
- Python framework for processing Universal Dependencies data☆58Updated this week
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆70Updated 2 weeks ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Updated 2 years ago
- Various utilities for processing the data.☆215Updated last week
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆57Updated last month
- Crawler for linguistic corpora☆213Updated 4 months ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆117Updated this week
- Java Wiktionary Library☆58Updated 3 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 6 years ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆75Updated 2 months ago
- Transform TMX to text☆28Updated 3 years ago
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆17Updated 5 years ago
- CONLL-U to Pandas DataFrame☆31Updated 8 years ago