scrapinghub / scrapinghub-stack-portia
Software stack used to run Portia spiders in Scrapinghub cloud
☆11Updated 5 years ago
Alternatives and similar repositories for scrapinghub-stack-portia
Users that are interested in scrapinghub-stack-portia are comparing it to the libraries listed below
Sorting:
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Updated 8 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- iCQA - Intelligent Community Question Answering Framework☆31Updated 8 years ago
- Generate westminster parliament charts as virtual-dom SVG.☆12Updated 3 years ago
- DataFlow GUI is a desktop application for constructing Big Data programs through building DAG☆12Updated 7 years ago
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- Recommendations Serving Engine using python☆28Updated 9 years ago
- Cuts movie dialog summary video.☆10Updated 9 years ago
- REST API server with built in auth, interface to ScyllaDB/Cassandra☆24Updated 7 years ago
- MIT Big Data Challenge☆14Updated 11 years ago
- Javascript library to talk to multiple OLAP backends from multiple frontends☆17Updated 12 years ago
- GHRecommender - personalized recommendations for GitHub projects based on information about repositories starred by the user☆26Updated 2 years ago
- ☆16Updated 7 years ago
- ZoomCharts JavaScript Charts library☆8Updated last week
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Exploration Library in Java☆12Updated last year
- A graph knowledge base implemented in neo4j.☆12Updated 6 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 3 years ago
- Code for the GPU mega-benchmark article☆14Updated 7 years ago
- Dump mysql tables to s3, and parse them☆31Updated 10 years ago
- Implementation of the Chinese Whispers graph clustering algorithm☆8Updated 7 years ago
- Data cleaning made easy☆8Updated 7 years ago
- IoTQL - an SQL-like language for the IoT☆16Updated 8 years ago
- Distributed text analysis suite based on Celery☆95Updated 2 years ago
- Repository for SF QConf 2015 Workshop☆16Updated 6 months ago
- Deprecated Git repository. Please move to☆24Updated 3 years ago
- The main Catalyst repository with the latest versions of all projects. The current Moderator is Patrick Wagstorm☆13Updated 8 years ago
- Deep learning certificate part 1☆10Updated 3 years ago
- Real-time chat and group chat for PEPS☆7Updated 9 years ago
- Web Data Extraction from Flat and Nested Records☆9Updated 9 years ago