djstrong / PL-Wiktionary-To-Dictionary
Parses Polish wiktionary and creates simple dictionaries of foreign languages (e.g. English) to Polish and vice versa.
☆16Updated 11 years ago
Alternatives and similar repositories for PL-Wiktionary-To-Dictionary:
Users that are interested in PL-Wiktionary-To-Dictionary are comparing it to the libraries listed below
- The source of the phonetic transcriptions is Oxford Advanced Learner's Dictionary (3rd ed.), available from the Oxford Text Archive (http…☆23Updated 7 years ago
- Modernized version of Eric Brill's Part Of Speech tagger.☆17Updated last year
- Supervised learning of morphology☆28Updated 8 years ago
- A parser and autocorrection tool for wiktionary.☆39Updated 9 years ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated 2 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆11Updated last year
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18Updated 2 months ago
- ACE View is a natural language based ontology and rule editor. ACE View uses Attempto Controlled English (ACE) in the front-end, and Web …☆8Updated 6 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Updated 8 years ago
- Natural Language Q/A app using DRT.☆34Updated 13 years ago
- phonetic transcription for Tibetan☆10Updated 6 years ago
- Recipes for training OpenNMT systems☆14Updated 7 years ago
- Python Unicode Block Utilities☆24Updated 4 years ago
- Command-line corpus tools☆9Updated 7 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated 10 months ago
- Easy language identification of 380 languages☆17Updated 5 years ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated 4 months ago
- U.S. Code Complexity☆23Updated 11 years ago
- ☆31Updated 3 years ago
- Parser for KAF NAF files written in Python☆16Updated 3 years ago
- Fast corpus search engine originally made for the Corpus of Written Tatar language☆16Updated 5 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated last year
- Scraper for TED Talks in Python. Get talk title, transcript, talk topics and so on.☆15Updated 7 years ago
- Learning Based Java (LBJava)☆13Updated 2 years ago
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- Offline bilingual dictionaries made using data from Wiktionary☆53Updated 9 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- common language and mathematics processing algorithms, in Rust☆26Updated last year
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆23Updated 10 years ago
- OCR for DjVu☆48Updated 2 years ago