dedeler / imdb-data-parser
Parses the IMDB dumps into TSV and Relational Database insert queries
☆59Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for imdb-data-parser
- Wikipedia Data Analysis Toolkit☆25Updated 8 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆34Updated 8 years ago
- Library for guessing a person's gender by their first name.☆57Updated 6 years ago
- A python tool for collecting tweets in mongoDB using the search API☆80Updated last year
- a framework and language for exploring and analyzing feeds of social media data.☆23Updated 12 years ago
- ☆21Updated 3 years ago
- Concept discovery and recommendation library built on top of the IBM Watson cognitive API.☆24Updated 8 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆12Updated 3 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 3 years ago
- CSV inspection☆10Updated last year
- (BROKEN, help wanted)☆15Updated 8 years ago
- A system for disambiguating toponyms (placenames) given textual context and creating visualizations of the locations referenced in a give…☆19Updated 11 years ago
- Simple python script for storing tweets from the twitter stream directly to a MongoDB database based on a list of terms.☆68Updated 3 years ago
- A web application for collecting tweets from the twitter API☆15Updated 9 years ago
- Python natural language processing work☆29Updated 15 years ago
- modification of bibliotools 2.2 from Sébastian Grauwin☆12Updated 5 years ago
- QUAC ("quantitative analysis of chatter" or any related acronym you like) is a package for acquiring and analyzing social Internet conten…☆68Updated 4 years ago
- a Simple API for RDF☆29Updated 15 years ago
- Highly performant version of open-text-summarizer☆38Updated 10 years ago
- Twitter user classification tutorial at PyCon France 2016☆21Updated last year
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆112Updated 8 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Scripts and microservice to feed an ElasticSearch with Wikidata and Inventaire entities, and keep those up-to-date☆41Updated 3 years ago