ContinuumIO / scrapy_scrapersLinks
Scraper built with Scrapy.
☆18Updated 10 months ago
Alternatives and similar repositories for scrapy_scrapers
Users that are interested in scrapy_scrapers are comparing it to the libraries listed below
Sorting:
- An online reference for data journalism☆25Updated 11 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆24Updated 8 years ago
- A glossary for the United States.☆42Updated 10 years ago
- Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.☆23Updated 9 years ago
- Importer for US Spending data☆33Updated 10 years ago
- JSON schemas for OpenCorporates data☆20Updated last month
- Canadian legislative scrapers☆32Updated this week
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- Open Knowledge coding standards and style guide.☆35Updated 5 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago
- Whit is an open source SMS service, which allows you to query CrunchBase, Wikipedia, and several other data APIs.☆198Updated 12 years ago
- ☆13Updated 9 years ago
- ☆22Updated 13 years ago
- The Python port of sucka.☆20Updated 10 years ago
- The OpenSextant Gazetteer is a collection of world-wide place name data☆12Updated 7 years ago
- Open Source Social Media Monitoring And Engagement System Core/API☆36Updated 10 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- A contextual news development environment.☆49Updated 10 years ago
- ☆10Updated 9 years ago
- Charts for the Consumer Financial Protection Bureau☆12Updated last year
- Topic modeling web application☆41Updated 9 years ago
- Ready or Not...☆50Updated 7 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 7 years ago
- Extract list of results from search engines pages as CSV with a bookmarklet directly within the browser☆24Updated 3 months ago
- An organization chart for the government of the United States.☆38Updated 11 years ago
- ☆25Updated 9 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago