ContinuumIO / scrapy_scrapers
Scraper built with Scrapy.
☆17Updated 8 months ago
Alternatives and similar repositories for scrapy_scrapers:
Users that are interested in scrapy_scrapers are comparing it to the libraries listed below
- An online reference for data journalism☆25Updated 11 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- ☆21Updated 9 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆16Updated 9 years ago
- Ask questions about government data.☆37Updated 6 years ago
- ☆13Updated 9 years ago
- Topic modeling web application☆40Updated 9 years ago
- a Simple API for RDF☆29Updated 15 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- A glossary for the United States.☆42Updated 10 years ago
- Browser add-on and web server to support collection and analysis of web browsing data.☆13Updated 9 years ago
- General Architecture for Text Engineering☆49Updated 9 years ago
- The OpenSextant Gazetteer is a collection of world-wide place name data☆12Updated 7 years ago
- This is a set of ontologies used by different parts of the Open Semantic Framework. These ontologies should normally be loaded in OSF usi…☆14Updated 11 years ago
- ☆25Updated 9 years ago
- Whit is an open source SMS service, which allows you to query CrunchBase, Wikipedia, and several other data APIs.☆198Updated 11 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 8 years ago
- A pastebin for tables.☆34Updated 11 years ago
- vIPer: a new tool for IPython notebooks.☆60Updated 10 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆24Updated 8 years ago
- Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 7 years ago
- Open Knowledge coding standards and style guide.☆35Updated 5 years ago
- JSON schemas for OpenCorporates data☆20Updated this week
- Python library and command line tool for converting data from one format to another☆99Updated 4 years ago
- The User Activity Logging Engine, or User-ALE, is a logging mechanism used to quantitatively assess the behavioural and cognitive state o…☆13Updated 8 years ago
- Hadoop MapReduce over Hive based implementation of attributed network pattern matching.☆40Updated 10 years ago
- ☆22Updated 13 years ago
- ☆13Updated 10 years ago
- Open Source Social Media Monitoring And Engagement System Core/API☆36Updated 10 years ago
- Scan a folder of document files of all types and extract the text into a CSV suitable for Overview☆26Updated 9 years ago