Grasia / wiki-scripts
Miscellaneous scripts to gather and process data of wikis.
☆22Updated last year
Alternatives and similar repositories for wiki-scripts:
Users that are interested in wiki-scripts are comparing it to the libraries listed below
- Python wrapper for the FrameNet library.☆24Updated 13 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 3 years ago
- Python tools for text☆15Updated 4 years ago
- Repository of data and code to use the models described in the paper "Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia…☆10Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Toolkit to compile a comparable/parallel corpus from European Parliament proceedings☆15Updated 5 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 6 years ago
- A thin wrapper around the DBpedia Spotlight HTTP API☆25Updated 7 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- Python package for stylometry☆61Updated 3 years ago
- All the notebooks for the analysis of Emotional Arcs within the Project Gutenberg corpus, see "The emotional arcs of stories are dominate…☆30Updated 4 months ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- Python 3 library for reading and writing warc files☆20Updated 7 years ago
- Wikidata embedding☆50Updated 4 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Language Model and Text Classification for German Language using Deep Learning☆18Updated 6 years ago
- This is the text partitioner project for Python.☆21Updated 6 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆45Updated 4 years ago
- ☆15Updated 6 years ago
- A set of utilities for processing MediaWiki XML dump data.☆50Updated 2 weeks ago
- Presentations & notebooks from our talks /workshops/meetups/etc☆24Updated 6 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated 2 years ago
- ☆28Updated last month
- Language features used in the NELA Toolkit and other news studies☆13Updated 4 years ago
- Classify names by gender, U.S. ethnicity, or leaf nationality☆19Updated 6 years ago
- The Potsdam Twitter Sentiment Corpus☆17Updated 5 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago