rspeer / wikiparsecLinks
An LL parser for extracting information from Wiki text, particularly Wiktionary.
☆49Updated 2 years ago
Alternatives and similar repositories for wikiparsec
Users that are interested in wikiparsec are comparing it to the libraries listed below
Sorting:
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆53Updated 4 years ago
- Wiktionary parser tool for many language editions.☆54Updated 3 years ago
- Random fun with statistical language models.☆64Updated 5 years ago
- ☆31Updated 8 years ago
- A language evolution simulator, using realistic phonetic changes.☆38Updated 2 years ago
- A tool for analyzing the word histories of a text.☆35Updated 9 months ago
- Helsinki Finite-State Technology (library and application suite)☆133Updated 3 months ago
- Frontend for Korp, a tool using the IMS Open Corpus Workbench (CWB).☆16Updated this week
- Wikidata property explorer☆17Updated last year
- The New Yorken Poesry Magazine is a cultured poetry journal by AI, for AI☆35Updated 6 years ago
- Combine two wikipedia pages to make new facts. Tweets @brand_new_facts☆18Updated 7 years ago
- Grammatical Framework's Resource Grammar Library (RGL)☆57Updated last week
- linguistics tree drawing to SVG in python, aimed at Jupyter☆65Updated last year
- An index of public broadcasts tagged by their primary language.☆53Updated 6 months ago
- Pandoc filter to use Wikidata as reference manager☆17Updated 4 years ago
- Analyse rhyme scheme, metre and form of poems☆132Updated 4 years ago
- Toki Pona Visual Dictionary with English, Italian and Russian translation in pictures☆32Updated 2 years ago
- The curation repository for the data behind Concepticon.☆39Updated last week
- An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, a rich XML-based format for linguistic a…☆17Updated 10 months ago
- eXtensible Interlinear Glossed Text☆33Updated 3 years ago
- universal syllabification algorithms☆45Updated 2 years ago
- poetry from dirty ocr☆62Updated 4 years ago
- A command-line tool for interacting with books in git☆111Updated last year
- Text-Induced Corpus Clean-up☆20Updated 2 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆223Updated 2 years ago
- Strips boilerplate from Project Gutenberg text files☆18Updated 4 years ago
- This is the repository for 2018's collaborative NaNoLiPo project.☆34Updated 6 years ago
- Java Wiktionary Library☆58Updated 2 years ago
- A Python module to discover the etymology of words☆149Updated last year
- A comprehensive graph of mathematical domains and topics☆22Updated 3 years ago