dedeler / imdb-data-parser
Parses the IMDB dumps into TSV and Relational Database insert queries
☆60Updated 11 years ago
Alternatives and similar repositories for imdb-data-parser:
Users that are interested in imdb-data-parser are comparing it to the libraries listed below
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 10 years ago
- A python tool for collecting tweets in mongoDB using the search API☆80Updated last year
- Python interface to IMDb plain-text data files☆41Updated 7 years ago
- An interactive map of reddit: the "front page of the internet"☆38Updated 9 years ago
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- mltk - Moz Language Tool Kit☆12Updated 10 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 9 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 3 years ago
- (BROKEN, help wanted)☆15Updated 9 years ago
- rapid nlp prototyping☆71Updated 2 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- Visualization of text sentiment using deep learning☆43Updated 8 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 8 years ago
- A web application for collecting tweets from the twitter API☆15Updated 10 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Simple python script for storing tweets from the twitter stream directly to a MongoDB database based on a list of terms.☆68Updated 3 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Demo code for learning_text_transformer☆25Updated 10 years ago
- A polite, minimal interface for sending python objects to and from Amazon S3.☆57Updated 9 years ago
- linkedin_to_neo4j☆24Updated 9 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago
- Reduction is a python script which automatically summarizes a text by extracting the sentences which are deemed to be most important.☆55Updated 10 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- ☆34Updated 8 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Hacker News: Crunching the Numbers☆71Updated 9 years ago
- CSV inspection☆10Updated 2 years ago
- Tool for computing continuous distributed representations of word. Modified to learn N-Grams☆15Updated 8 years ago