socialsensor / storm-focused-crawlerLinks
Collects multimedia content shared through social networks.
☆19Updated 10 years ago
Alternatives and similar repositories for storm-focused-crawler
Users that are interested in storm-focused-crawler are comparing it to the libraries listed below
Sorting:
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Updated 8 years ago
- Client side extractive text summarization using JS, based on TextRank. Since there's no server trip involved, one can can safely use it f…☆16Updated 11 years ago
- The first Open Source document analysis platform☆65Updated 4 years ago
- Chambua is an open-source semantic tagging application that analyses text and extracts names of people, places (& geocodes them), organis…☆33Updated 3 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 2 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- Vizlinc☆15Updated 9 years ago
- ☆14Updated 8 years ago
- ☆55Updated 5 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆82Updated 12 years ago
- ☆13Updated 9 years ago
- Sentiment analysis framework developed by CERTH.☆22Updated 10 years ago
- Deprecated Module: See Xponents or OpenSextantToolbox as active code base.☆31Updated 12 years ago
- Parse wikipedia dumps and index (some) page data to elasticsearch☆49Updated 10 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆25Updated 12 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Updated 11 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- fuzzydb is a fuzzy matching database engine capable of providing human-like search results that make life much easier for users of websit…☆20Updated 2 years ago
- Simple search results with Solr and EmberJS☆58Updated 6 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 3 years ago
- Blog crawler for the blogforever project.☆23Updated 11 years ago
- Focused Crawler for VT's CTRNet☆10Updated 12 years ago
- A crawler for various popular tech news sources. Read technology news from the comfort of your CLI.☆56Updated 12 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Updated 9 years ago
- Load a linkedin network w/ python py2neo into a neo4j database, serve it via node.js, and display it w/ sigma.js☆29Updated 12 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- gzipstream allows Python to process multi-part gzip files from a streaming source☆23Updated 8 years ago
- Node wrapper for Ark-TweetNLP.☆16Updated 9 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated 6 months ago