garysieling / wikipedia-categorization
☆16Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for wikipedia-categorization
- stoplists for African languages generated from the ASP corpus☆14Updated 8 years ago
- A browser extension providing Open Access bibliographical services☆14Updated last year
- List of (possible) English hedge words☆44Updated 2 years ago
- Berkeley DLab Python Intensive May 23-26☆27Updated 8 years ago
- Linked Open Data publication engine☆6Updated 6 years ago
- Merck challenge at Kaggle☆10Updated 10 years ago
- U.S. Code Complexity☆23Updated 11 years ago
- Graph-based framework for text classification☆24Updated 6 years ago
- bigram / trigram analysis of wikipedia; mainly mutual info☆22Updated 12 years ago
- Extract Data from Wikipedia Lists☆30Updated 7 years ago
- List of easy American-English words: The New Dale-Chall (1995)☆32Updated 2 years ago
- This repository contains the CEO ontology, the evaluation corpus and the CEO vocabulary.☆10Updated 6 years ago
- A distributed system for mining common crawl using SQS, AWS-EC2 and S3☆14Updated 10 years ago
- Different techniques to measure the quality of Wikipedia☆9Updated 7 years ago
- Crawling and analyzing data on Wikipedia☆16Updated 8 months ago
- Alignment, a collaborative, system aided, user driven ontology/vocabulary matching and validation platform.☆12Updated 2 years ago
- RESTful API around the PETRARCH coding software☆10Updated 3 years ago
- Tools & scripts to infer new Wikipedia infobox to ontology mappings☆21Updated 8 years ago
- JavaScript library for annotations on files rendered with Box Content Preview☆10Updated 3 months ago
- Basic dataset for the linguistic data collection.☆15Updated 7 years ago
- Scrapes citation statistics from Google Scholar☆60Updated 3 weeks ago
- This repository for Web Crawling, Information Extraction, and Knowledge Graph build up.☆33Updated 6 years ago
- Comparing warc files☆15Updated 5 years ago
- An online reference for data journalism☆25Updated 10 years ago
- Whit is an open source SMS service, which allows you to query CrunchBase, Wikipedia, and several other data APIs.☆198Updated 11 years ago
- *SEM 2018: Learning Distributed Event Representations with a Multi-Task Approach☆22Updated 6 years ago
- List of (possible) English buzzword words☆56Updated 2 years ago
- Metaphor detection using NLP techniques, made in Python using NLTK☆19Updated 10 years ago
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18Updated 3 weeks ago