Grasia / wiki-scriptsLinks
Miscellaneous scripts to gather and process data of wikis.
☆21Updated 2 years ago
Alternatives and similar repositories for wiki-scripts
Users that are interested in wiki-scripts are comparing it to the libraries listed below
Sorting:
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆61Updated last year
- Code for learning geographically-informed word embeddings☆22Updated 3 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Presentations & notebooks from our talks /workshops/meetups/etc☆24Updated 7 years ago
- A thin wrapper around the DBpedia Spotlight HTTP API☆25Updated 7 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- Cite: http://www.aclweb.org/anthology/W/W17/W17-08.pdf#page=103☆8Updated 8 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Repository of data and code to use the models described in the paper "Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia…☆10Updated 2 years ago
- A set of utility scripts to process Wikipedia related data☆38Updated 2 years ago
- Python library providing sentiment lexicons.☆26Updated 8 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- Finds linguistic patterns effortlessly☆36Updated last year
- This is the text partitioner project for Python.☆21Updated 6 years ago
- Wikidata embedding☆50Updated 7 months ago
- Raw Wikipedia counts for entity linking☆19Updated 8 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- Ensemble topic modeling with matrix factorization☆25Updated 7 years ago
- Python tools for text☆15Updated 5 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29Updated 5 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- Toolkit to compile a comparable/parallel corpus from European Parliament proceedings☆16Updated 5 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated 11 months ago