rspeer / wikiparsec
An LL parser for extracting information from Wiki text, particularly Wiktionary.
☆48Updated last year
Alternatives and similar repositories for wikiparsec:
Users that are interested in wikiparsec are comparing it to the libraries listed below
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- Pandoc filter to use Wikidata as reference manager☆17Updated 4 years ago
- The curation repository for the data behind Concepticon.☆37Updated this week
- Frontend for Korp, a tool using the IMS Open Corpus Workbench (CWB).☆16Updated this week
- Tools for the 3rd edition of the Constraint Grammar formalism.☆21Updated this week
- eXtensible Interlinear Glossed Text☆32Updated 2 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆13Updated 5 years ago
- A tool for analyzing the word histories of a text.☆34Updated 2 months ago
- Sort-friendly URI Reordering Transform (SURT) python module☆41Updated 6 months ago
- Command-line corpus tools☆9Updated 7 years ago
- A comprehensive graph of mathematical domains and topics☆20Updated 3 years ago
- Random fun with statistical language models.☆65Updated 5 years ago
- Perpetual Access To The Scholarly Record☆118Updated 5 months ago
- Combine two wikipedia pages to make new facts. Tweets @brand_new_facts☆18Updated 6 years ago
- ☆29Updated 7 years ago
- Basic dataset for the linguistic data collection.☆15Updated 7 years ago
- English Resource Grammar☆20Updated 5 months ago
- Wikidata property explorer☆16Updated 11 months ago
- A language evolution simulator, using realistic phonetic changes.☆38Updated last year
- TEI Reader Python Library☆17Updated last year
- MG top-down beam parsing☆13Updated 6 years ago
- This repository contains code behind the visualization of the Wikimedia tool etytree at http://tools.wmflabs.org/etytree/☆50Updated 5 years ago
- The Unicode Cookbook for Linguists☆53Updated 4 years ago
- A set of tools for analysis of texts in the Ithkuil constructed language☆31Updated 5 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆17Updated this week
- Lexical data at Unicode☆67Updated 4 months ago
- Morphosyntactic tagger for Norwegian bokmål and nynorsk☆30Updated last year
- universal syllabification algorithms☆44Updated 2 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated last year