socialsensor / storm-focused-crawlerLinks
Collects multimedia content shared through social networks.
☆19Updated 10 years ago
Alternatives and similar repositories for storm-focused-crawler
Users that are interested in storm-focused-crawler are comparing it to the libraries listed below
Sorting:
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Updated 9 years ago
- Client side extractive text summarization using JS, based on TextRank. Since there's no server trip involved, one can can safely use it f…☆16Updated 11 years ago
- The first Open Source document analysis platform☆65Updated 4 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 3 years ago
- A crawler for various popular tech news sources. Read technology news from the comfort of your CLI.☆56Updated 12 years ago
- fuzzydb is a fuzzy matching database engine capable of providing human-like search results that make life much easier for users of websit…☆20Updated 2 years ago
- ☆13Updated 10 years ago
- Sentiment analysis framework developed by CERTH.☆22Updated 10 years ago
- Vizlinc☆15Updated 9 years ago
- Parse wikipedia dumps and index (some) page data to elasticsearch☆49Updated 10 years ago
- Focused Crawler for VT's CTRNet☆10Updated 12 years ago
- Simple search results with Solr and EmberJS☆58Updated 6 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- KnowledgeStore☆21Updated 7 years ago
- Generates visualizations of influential tweets about a given hashtag.☆11Updated 8 years ago
- ☆55Updated 5 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆25Updated 13 years ago
- ☆20Updated 8 years ago
- Deprecated Module: See Xponents or OpenSextantToolbox as active code base.☆30Updated 12 years ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated 10 months ago
- Writer Identification of Handwritten Documents☆13Updated 8 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 9 years ago
- Chambua is an open-source semantic tagging application that analyses text and extracts names of people, places (& geocodes them), organis…☆33Updated 4 years ago
- Data science tools from Moz☆23Updated 8 years ago
- System for mining Wikipedia Usage data to read our collective mind☆20Updated 11 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago
- Raw Wikipedia counts for entity linking☆19Updated 8 years ago
- bigram / trigram analysis of wikipedia; mainly mutual info☆22Updated 13 years ago
- ☆14Updated 8 years ago
- gzipstream allows Python to process multi-part gzip files from a streaming source☆23Updated 8 years ago