brianckeegan / Wikipedia
Crawling and analyzing data on Wikipedia
☆16Updated last year
Alternatives and similar repositories for Wikipedia:
Users that are interested in Wikipedia are comparing it to the libraries listed below
- Processing OpenCitations Data☆20Updated 7 years ago
- Python API for KB data-services☆19Updated 5 years ago
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 8 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- Stylometric framework in Python☆17Updated 10 years ago
- A simple Web crawler for stackshare.io using scrapy .☆9Updated 6 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 8 years ago
- ☆13Updated 10 years ago
- Tutorial for performing queries on the Wikipedia API for social network analysis☆48Updated 10 years ago
- Cognitive Atlas☆16Updated 8 years ago
- The Open Scholarly Edition of James Joyce's A Portrait of the Artist as a Young Man☆20Updated 6 years ago
- Code, data, and paper for Academia.edu citation advantage analysis☆31Updated 9 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆12Updated 4 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆64Updated 8 years ago
- The OpenCitations RDF Resource Browser☆14Updated 2 weeks ago
- modification of bibliotools 2.2 from Sébastian Grauwin☆11Updated 5 years ago
- Server-side Zotero translation based on Mozilla xpcshell (deprecated)☆38Updated 6 years ago
- ☆48Updated 10 years ago
- Berkeley DLab Python Intensive May 23-26☆28Updated 8 years ago
- Citation Style Language utilities☆18Updated 4 years ago
- Scraper built with Scrapy.☆17Updated 8 months ago
- Take streaming tweets, extract hashtags & usernames, create graph, export graphml for Gephi visualisation☆38Updated 11 years ago
- Humanities Data Curation Record☆11Updated 7 years ago
- The code that used to power the http://dataverse.org website, distinct from the repository software at https://github.com/IQSS/dataverse☆17Updated 6 years ago
- Text Thresher crowd sourced text annotator☆16Updated 7 years ago
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18Updated last week
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Updated 10 years ago
- Adding links to full text in Wikipedia references☆37Updated last year