Grasia / wiki-scripts
Miscellaneous scripts to gather and process data of wikis.
☆22Updated last year
Alternatives and similar repositories for wiki-scripts:
Users that are interested in wiki-scripts are comparing it to the libraries listed below
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Ensemble topic modeling with matrix factorization☆24Updated 6 years ago
- A multilingual lexicon of words to hurt.☆82Updated 2 months ago
- A TextBlob sentiment analysis pipeline component for spaCy.☆56Updated 3 months ago
- Presentations & notebooks from our talks /workshops/meetups/etc☆24Updated 6 years ago
- SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution☆22Updated 5 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 4 years ago
- A set of utility scripts to process Wikipedia related data☆37Updated 2 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated 11 months ago
- The News Landscape Toolkit (NELA)☆15Updated 4 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Cython wrapper on Hunspell Dictionary☆66Updated 7 months ago
- Experiments to help discussion on Wikipedia talk pages☆66Updated 2 months ago
- Code for learning geographically-informed word embeddings☆22Updated 2 years ago
- Finds linguistic patterns effortlessly☆35Updated last year
- Lexicons for the Multilingual UCREL Semantic Analysis System☆40Updated last year
- public repository of the interdisciplinary working group 'Hatespeech' of the research training group UCSM☆17Updated 5 years ago
- Notebooks and data associated to constructing and exploring a map of subreddits.☆55Updated 7 years ago
- Tutorial for using twarc, with steps for installing software.☆25Updated 6 years ago
- ☆22Updated last year
- TeXoo – A Zoo of Text Extractors☆18Updated 4 years ago
- LNEx: Location Name Extractor☆24Updated 4 years ago
- Harassment Lexicon and Corpus☆29Updated 6 years ago
- Sentiment Analysis and Cognition Engine (text analysis tool)☆18Updated 4 years ago
- Toolkit to compile a comparable/parallel corpus from European Parliament proceedings☆15Updated 5 years ago
- An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)☆12Updated 3 years ago
- ☆31Updated 9 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago