hardikvasa / wikipedia-crawlerLinks
This is a program to crawl entire 'Wikipedia' and extract & store information from the pages as required.
☆75Updated last year
Alternatives and similar repositories for wikipedia-crawler
Users that are interested in wikipedia-crawler are comparing it to the libraries listed below
Sorting:
- Fake news detection, Google Summer of Code 2017☆91Updated 7 years ago
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆260Updated 9 years ago
- OpenTC is a text classification engine using several algorithms in machine learning☆27Updated 5 years ago
- 💫 Scripts, tools and resources for developing spaCy☆126Updated 6 years ago
- An introduction to using spaCy for NLP and machine learning☆192Updated 3 years ago
- ☆40Updated 9 years ago
- Python wrapper for Stanford CoreNLP tools☆58Updated 9 years ago
- A baseline implementation for FNC-1☆138Updated 3 years ago
- Uses NLP and wikipedia to try to generate trivia questions☆132Updated 8 years ago
- Using Word2Vec to explore semantic similarities between the entities of "A Song of Ice and Fire" ("Game of Thrones").☆25Updated 9 years ago
- A library & tools to evaluate predictive language models.☆63Updated 2 years ago
- Train word embeddings with Gensim and vizualize them with TensorBoard☆34Updated 6 years ago
- The goal of this project is to implement a Question Answering (QA) system that answers causal type questions. We use Wikipedia as a knowl…☆102Updated 12 years ago
- This repo contains the code and documentation for automatic essay evaluation and short answer evaluation.☆24Updated 8 years ago
- Download scripts for distributing twitter data.☆62Updated 2 years ago
- Natural Language Processing☆95Updated 8 years ago
- A Python based web crawler that crawls all the web pages in a breathe-first approach from the given seed page☆14Updated 10 years ago
- Simple practice for text classification using Python☆58Updated 10 years ago
- Python scripts for building 'Short Jokes' dataset, featured on Kaggle☆276Updated 4 years ago
- A pythonic wrapper for Stanford CoreNLP.☆108Updated 7 years ago
- Collection of functions and scripts for text retrieval in Python: Document collection preprocessing, Feature Selection, Indexing, Query p…☆43Updated 12 years ago
- http://cs224n.stanford.edu☆61Updated 9 years ago
- Datasets for Deep learning Personas☆62Updated 7 years ago
- A very brief introduction to Natural Language Processing programming in Python☆151Updated last year
- HackDelft☆81Updated 8 years ago
- Community Curated NLP List☆198Updated 3 years ago
- Python code for detecting topics/events from a Twitter stream☆100Updated 7 years ago
- Working with sentiment analysis in Python.☆213Updated 10 years ago
- Twitter hashtag prediction☆282Updated 8 years ago
- Deep Learning models to detect hate speech in tweets☆217Updated 7 years ago