ybourque / Wikparser
Wiktionary Parser
☆28Updated 8 years ago
Alternatives and similar repositories for Wikparser:
Users that are interested in Wikparser are comparing it to the libraries listed below
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated 2 months ago
- A parser and autocorrection tool for wiktionary.☆39Updated 9 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Hierarchical phrase-based machine translation system☆32Updated 10 years ago
- Fast Word Clustering Software☆78Updated 2 months ago
- Multilingual Language Modeling Toolkit☆11Updated 7 years ago
- A fast, simple, multilingual tokenizer☆29Updated 7 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Command-line corpus tools☆9Updated 7 years ago
- Morpha lex stemmer converted using jflex.☆23Updated 4 years ago
- The Community-enRiched Open WordNet (CROWN)☆18Updated 9 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- WordNet Domains, WordNet Affect and SentiWords☆48Updated 9 years ago
- Grammar rules and dictionaries for the phonetic transcription of Russian sentences☆33Updated 3 years ago
- Fast approximate strings search & spelling correction☆58Updated 3 years ago
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆23Updated 8 years ago
- A tool for calculation semantic similarity between words from a text corpus based on lexico-syntactic patterns.☆27Updated 9 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- Thot toolkit for statistical machine translation☆53Updated 2 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17Updated 2 years ago
- Code for morphological transformations☆29Updated 7 years ago
- PurePos is an open source hybrid morphological tagger.☆16Updated 4 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆11Updated last year
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 5 years ago
- The SRL-based Open IE extractor. A principal component of Open IE 4.0.☆19Updated 7 years ago
- Stanford CoreNLP annotator implementing jMWE for detecting Multi-Word Expressions / collocations☆15Updated 8 years ago
- NameTag: Named Entity Tagger☆38Updated 7 months ago
- morphologically informed POS tagging for German☆25Updated 3 years ago
- Basic dataset for the linguistic data collection.☆15Updated 8 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago