frankier / wikiparseLinks

Scrapes some Finnish word definitions from English Wiktionary.

☆8

Alternatives and similar repositories for wikiparse

Users that are interested in wikiparse are comparing it to the libraries listed below

Sorting:

juditacs / wikt2dict
Wiktionary parser tool for many language editions.
☆54Updated 2 years ago
proycon / folia
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…
☆65Updated last year
dbpedia / gfs
DBpedia, which frequently crawls and analyses over 120 Wikipedia language editions has near complete information about (1) which facts ar…
☆11Updated 2 years ago
korpling / salt
A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…
☆15Updated 2 years ago
globalwordnet / schemas
WordNet-LMF formats
☆22Updated last month
yandex / dep_tregex
Stanford Tregex-inspired language for rule-based dependency tree manipulation.
☆21Updated 8 years ago
dkpro / dkpro-uby
Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format
☆22Updated 7 years ago
amir-zeldes / DepEdit
A simple configurable tool for manipulating dependency trees.
☆13Updated 6 months ago
fbkarsdorp / tmi
Flask Interface to Thompson's Motif Index
☆18Updated 6 years ago
korpling / graphANNIS
This is a new backend implementation of the ANNIS linguistic search and visualization system.
☆17Updated last month
jmccrae / lemon-model.net
Source for lemon-model.net
☆11Updated 3 years ago
acoli-repo / olia
Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.
☆20Updated 2 months ago
SemiringInc / Mueller-Report-Corpus
The Mueller Report Corpus V 0.1
☆11Updated 5 years ago
unimorph / wiktionary-tools
Tools for scraping, annotating, and parsing morphological information from Wiktionary
☆15Updated 5 years ago
LuminosoInsight / exquisite-corpus
Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.
☆52Updated 4 years ago
KIZI / LinkedHypernymsDataset
☆14Updated 3 years ago
ufal / treex
Treex NLP framework
☆32Updated 2 weeks ago
cltl / KafNafParserPy
Parser for KAF NAF files written in Python
☆16Updated 4 years ago
mediawiki-utilities / python-mwapi
Simple Python Wrapper around MediaWiki API
☆30Updated 2 years ago
mansayk / fastmorph
Fast corpus search engine originally made for the Corpus of Written Tatar language
☆17Updated 5 years ago
CentreForDigitalHumanities / tei-reader
TEI Reader Python Library
☆17Updated last week
wikimedia / articlequality
Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)
☆49Updated 2 weeks ago
componavt / wikokit
Machine-readable Wiktionary
☆76Updated last year
JonathanReeve / macro-etym
A tool for analyzing the word histories of a text.
☆34Updated 7 months ago
proycon / LaMachine
LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…
☆68Updated last year
LanguageMachines / ucto
Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…
☆69Updated 3 weeks ago
rawlins / svgling
linguistics tree drawing to SVG in python, aimed at Jupyter
☆65Updated 10 months ago
korpling / ANNIS
ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…
☆75Updated last month
korpling / pepper
A highly extensible plattform for conversion and manipulation of linguistic data between an unbound set of formats. Pepper can be used st…
☆24Updated 6 months ago
annotation / stam
Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…
☆21Updated last month