zseder / hundictLinks
bilingual dictionary extractor from parallel corpora
☆22Updated 11 years ago
Alternatives and similar repositories for hundict
Users that are interested in hundict are comparing it to the libraries listed below
Sorting:
- UniParse: A universal graph-based parsing toolkit☆10Updated 5 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Corpus preprocessing☆97Updated last year
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 6 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 5 months ago
- The zhong [|] Chinese grammars☆14Updated last month
- LSTM Language Model with Subword Units Input Representations☆42Updated 4 years ago
- A database of number names for 186 languages, locales, and scripts☆67Updated 2 years ago
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆73Updated 10 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated 2 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆77Updated 2 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated 2 years ago
- An Interactive Tool for Annotating Discourse Structure and Text Improvement☆16Updated 3 years ago
- Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format☆22Updated 7 years ago
- A tool for text normalisation via character-level machine translation☆13Updated 5 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆44Updated 8 months ago
- Concept dictionary☆38Updated last year
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- Workshop on Noisy User-generated Text (W-NUT)☆30Updated 2 months ago
- ☆43Updated 10 years ago
- ☆23Updated 8 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 10 months ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆14Updated 5 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- ☆74Updated 3 months ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 6 years ago
- Excitement Open Platform for Recognizing Textual Entailments☆88Updated 7 years ago