frankier / wikiparseLinks
Scrapes some Finnish word definitions from English Wiktionary.
☆8Updated last year
Alternatives and similar repositories for wikiparse
Users that are interested in wikiparse are comparing it to the libraries listed below
Sorting:
- Wiktionary parser tool for many language editions.☆54Updated 2 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year
- DBpedia, which frequently crawls and analyses over 120 Wikipedia language editions has near complete information about (1) which facts ar…☆11Updated 2 years ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated 2 years ago
- WordNet-LMF formats☆22Updated last month
- Stanford Tregex-inspired language for rule-based dependency tree manipulation.☆21Updated 8 years ago
- Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format☆22Updated 7 years ago
- A simple configurable tool for manipulating dependency trees.☆13Updated 6 months ago
- Flask Interface to Thompson's Motif Index☆18Updated 6 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆17Updated last month
- Source for lemon-model.net☆11Updated 3 years ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated 2 months ago
- The Mueller Report Corpus V 0.1☆11Updated 5 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 5 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 4 years ago
- ☆14Updated 3 years ago
- Treex NLP framework☆32Updated 2 weeks ago
- Parser for KAF NAF files written in Python☆16Updated 4 years ago
- Simple Python Wrapper around MediaWiki API☆30Updated 2 years ago
- Fast corpus search engine originally made for the Corpus of Written Tatar language☆17Updated 5 years ago
- TEI Reader Python Library☆17Updated last week
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆49Updated 2 weeks ago
- Machine-readable Wiktionary☆76Updated last year
- A tool for analyzing the word histories of a text.☆34Updated 7 months ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆69Updated 3 weeks ago
- linguistics tree drawing to SVG in python, aimed at Jupyter☆65Updated 10 months ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆75Updated last month
- A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…☆24Updated 6 months ago
- Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…☆21Updated last month