Grasia / wiki-scriptsLinks
Miscellaneous scripts to gather and process data of wikis.
☆20Updated 2 years ago
Alternatives and similar repositories for wiki-scripts
Users that are interested in wiki-scripts are comparing it to the libraries listed below
Sorting:
- A thin wrapper around the DBpedia Spotlight HTTP API☆25Updated 7 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- Wikidata embedding☆51Updated last year
- ☆32Updated 5 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- A compound word splitter for Python☆49Updated 4 years ago
- A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python☆112Updated 6 months ago
- This is the text partitioner project for Python.☆21Updated 6 years ago
- Python package for stylometry☆63Updated 4 years ago
- Essential NLP & ML, short & fast pure Python code☆78Updated 2 months ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆62Updated last year
- Experiments to help discussion on Wikipedia talk pages☆68Updated this week
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 5 years ago
- Quickly extract multi-word phrases from a corpus☆194Updated 5 years ago
- ☆59Updated 10 years ago
- Code for learning geographically-informed word embeddings☆22Updated 3 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- Doing things with embeddings☆66Updated 3 years ago
- Soundex Phonetic Code Algorithm Demo for Indian Languages. Supports all indian languages and English. Provides intra-indic string compari…☆58Updated 6 years ago
- ☆70Updated 3 years ago
- Negation detection NLP tool. If you use the code, please cite George Gkotsis, Sumithra Velupillai, Anika Oellrich, Harry Dean,…☆54Updated 8 years ago
- Harassment Lexicon and Corpus☆30Updated 7 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆116Updated last year
- ☆105Updated 7 years ago
- A collection of over 1.5 Million tweets data translated to French, with their sentiment.☆35Updated 8 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- Presentations & notebooks from our talks /workshops/meetups/etc☆24Updated 7 years ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Language-agnostic political event coding using universal dependencies☆18Updated 6 years ago