rspeer / wikiparsecLinks
An LL parser for extracting information from Wiki text, particularly Wiktionary.
☆49Updated 2 years ago
Alternatives and similar repositories for wikiparsec
Users that are interested in wikiparsec are comparing it to the libraries listed below
Sorting:
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆56Updated 4 years ago
- A tool for analyzing the word histories of a text.☆35Updated 11 months ago
- Helsinki Finite-State Technology (library and application suite)☆136Updated 3 weeks ago
- A comprehensive graph of mathematical domains and topics☆22Updated 3 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated last year
- Frontend for Korp, a tool using the IMS Open Corpus Workbench (CWB).☆16Updated last week
- A language evolution simulator, using realistic phonetic changes.☆39Updated 2 years ago
- eXtensible Interlinear Glossed Text☆33Updated 3 years ago
- ☆31Updated 8 years ago
- Pandoc filter to use Wikidata as reference manager☆18Updated 5 years ago
- WordNet-LMF formats☆24Updated 4 months ago
- poetry from dirty ocr☆62Updated 4 years ago
- Random fun with statistical language models.☆63Updated 6 years ago
- linguistics tree drawing to SVG in python, aimed at Jupyter☆65Updated last year
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 6 years ago
- Wikidata property explorer☆17Updated last year
- Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…☆19Updated 2 months ago
- A command-line tool for interacting with books in git☆111Updated last year
- Java Wiktionary Library☆58Updated 3 years ago
- Automatically exported from code.google.com/p/foma☆124Updated 2 months ago
- A python library to deal with scientific papers.☆17Updated 9 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆38Updated 2 years ago
- Global ASP - African Storybook Project for the World☆16Updated 2 months ago
- English Resource Grammar☆24Updated 3 weeks ago
- Processor scripts for Wikipedia dumps to crush them into a dense binary format that is easy to pathfind with.☆62Updated 8 years ago
- universal syllabification algorithms☆45Updated 2 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆223Updated 2 years ago
- The curation repository for the data behind Concepticon.☆40Updated 2 weeks ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago