rspeer / wikiparsec
An LL parser for extracting information from Wiki text, particularly Wiktionary.
☆49Updated last year
Alternatives and similar repositories for wikiparsec:
Users that are interested in wikiparsec are comparing it to the libraries listed below
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- The curation repository for the data behind Concepticon.☆38Updated 2 months ago
- eXtensible Interlinear Glossed Text☆32Updated 2 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated 11 months ago
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆51Updated 5 years ago
- Command-line corpus tools☆9Updated 7 years ago
- ☆30Updated 8 years ago
- A language evolution simulator, using realistic phonetic changes.☆38Updated 2 years ago
- Insert matching punctuation for mismatched quotation marks, parentheses, etc. Good postprocessing for N-gram text synthesis.☆15Updated 9 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆13Updated 5 years ago
- Morphosyntactic tagger for Norwegian bokmål and nynorsk☆30Updated last year
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- Pandoc filter to use Wikidata as reference manager☆17Updated 4 years ago
- Supervised learning of morphology☆28Updated 8 years ago
- Tools for the 3rd edition of the Constraint Grammar formalism.☆22Updated 2 weeks ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Simple CORPORA list crawler☆10Updated 8 years ago
- Frontend for Korp, a tool using the IMS Open Corpus Workbench (CWB).☆16Updated last week
- The Open Multilingual Wordnet☆61Updated 11 months ago
- Building and Using A Seed Corpus for the Human Language Project☆11Updated 7 years ago
- English Resource Grammar☆21Updated 8 months ago
- TEI Reader Python Library☆17Updated last year
- Automatically exported from code.google.com/p/hunpos☆12Updated 7 years ago
- Deutsch Language Tool Kit☆12Updated 9 years ago
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic a…☆18Updated 5 months ago
- A web framework to display Cross Linguistic Linked Data.☆56Updated 2 months ago
- Grammatical Framework's Resource Grammar Library (RGL)☆56Updated 3 weeks ago
- Random fun with statistical language models.☆65Updated 5 years ago
- A comprehensive graph of mathematical domains and topics☆21Updated 3 years ago