ContinuumIO / scrapy_scrapers
Scraper built with Scrapy.
☆14Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for scrapy_scrapers
- ☆21Updated 8 years ago
- An online reference for data journalism☆25Updated 10 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆23Updated 8 years ago
- APIs that access United States Federal Government content and metadata on govinfo.☆10Updated 6 years ago
- Charts for the Consumer Financial Protection Bureau☆12Updated 7 months ago
- Ask questions about government data.☆37Updated 5 years ago
- This is a set of ontologies used by different parts of the Open Semantic Framework. These ontologies should normally be loaded in OSF usi…☆14Updated 10 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- Topic modeling web application☆39Updated 9 years ago
- A series of analytics for creating networks from geo-temporal track data based on time/space co-occurrence. Includes UI for visualizatio…☆14Updated 6 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 6 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 9 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 6 years ago
- c-span opened captions node buffer server + google docs apps script☆8Updated 5 years ago
- A glossary for the United States.☆42Updated 9 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆14Updated 9 years ago
- Demo of the Newspaper article extraction library.☆29Updated 9 years ago
- R-implementation of a Markov-Modulated Poisson Process for unsupervised event detection.☆14Updated 8 years ago
- Hadoop MapReduce over Hive based implementation of attributed network pattern matching.☆40Updated 10 years ago
- framework for making streamcorpus data☆11Updated 7 years ago
- ☆13Updated 8 years ago
- Whit is an open source SMS service, which allows you to query CrunchBase, Wikipedia, and several other data APIs.☆198Updated 11 years ago
- Code and templates required to build the DARPA open catalog.☆17Updated 8 years ago
- SmallK: very fast data clustering tools☆14Updated 5 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago