rspeer / wikiparsec
An LL parser for extracting information from Wiki text, particularly Wiktionary.
☆48Updated last year
Related projects ⓘ
Alternatives and complementary repositories for wikiparsec
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- Frontend for Korp, a tool using the IMS Open Corpus Workbench (CWB).☆16Updated this week
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 3 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆61Updated 6 months ago
- eXtensible Interlinear Glossed Text☆31Updated 2 years ago
- A tool for analyzing the word histories of a text.☆34Updated 3 months ago
- The curation repository for the data behind Concepticon.☆34Updated this week
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated 8 months ago
- Tools for the 3rd edition of the Constraint Grammar formalism.☆21Updated 2 months ago
- Pandoc filter to use Wikidata as reference manager☆17Updated 4 years ago
- Command-line corpus tools☆9Updated 7 years ago
- The Open Multilingual Wordnet☆58Updated 6 months ago
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic a…☆18Updated this week
- TEI Reader Python Library☆16Updated 11 months ago
- Simple CORPORA list crawler☆10Updated 7 years ago
- Random fun with statistical language models.☆65Updated 5 years ago
- Basic dataset for the linguistic data collection.☆15Updated 7 years ago
- Automatically exported from code.google.com/p/hunpos☆11Updated 6 years ago
- Python framework for processing Universal Dependencies data☆57Updated this week
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆13Updated 5 years ago
- Helsinki Finite-State Technology (library and application suite)☆123Updated this week
- ☆27Updated 7 years ago
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆50Updated 5 years ago
- CLI tool for importing entities from Wikidata / Wikibase☆23Updated 2 years ago
- Sort-friendly URI Reordering Transform (SURT) python module☆40Updated 3 months ago
- A comprehensive graph of mathematical domains and topics☆20Updated 2 years ago
- a python package for cleaning Gutenberg books and dataset☆32Updated last year
- bilingual dictionary extractor from parallel corpora☆22Updated 10 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆86Updated 10 months ago