dedeler / imdb-data-parser
Parses the IMDB dumps into TSV and Relational Database insert queries
☆60Updated 11 years ago
Alternatives and similar repositories for imdb-data-parser:
Users that are interested in imdb-data-parser are comparing it to the libraries listed below
- Compute association strength over semantic networks in a dimensionality-reduced form.☆33Updated 9 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 10 years ago
- Python interface to IMDb plain-text data files☆41Updated 7 years ago
- A web application for collecting tweets from the twitter API☆15Updated 9 years ago
- A python tool for collecting tweets in mongoDB using the search API☆80Updated last year
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆55Updated 9 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- ☆41Updated 4 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago
- Tweet Lake is a commandline interface to Twitter Streaming API and big data project that extracts interesting stats out of tweet corpus.☆20Updated 2 years ago
- ☆89Updated 9 years ago
- Simple python script for storing tweets from the twitter stream directly to a MongoDB database based on a list of terms.☆68Updated 3 years ago
- topic model visualization☆52Updated 9 years ago
- Library for guessing a person's gender by their first name.☆57Updated 7 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Public Machine Learning and Data Competition Repo☆54Updated 9 years ago
- Goal: make Pattern compatible with Python 3.☆59Updated 4 years ago
- A polite, minimal interface for sending python objects to and from Amazon S3.☆57Updated 8 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- Data Server for Topic Models☆121Updated last year
- Wikipedia Data Analysis Toolkit☆25Updated 8 years ago
- Visualization of text sentiment using deep learning☆44Updated 8 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- QUAC ("quantitative analysis of chatter" or any related acronym you like) is a package for acquiring and analyzing social Internet conten…☆68Updated 4 years ago