A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This allows Nutch to rely on Selenium/Firefox to fetch and load javascript/content; while keeping Nutch in charge of what it does best: crawling and further parsing.
☆16Jun 9, 2016Updated 9 years ago
Alternatives and similar repositories for nutch-selenium-grid-plugin
Users that are interested in nutch-selenium-grid-plugin are comparing it to the libraries listed below
Sorting:
- ☆28Jun 9, 2016Updated 9 years ago
- Nutch with Cassandra and Elasticsearch on Docker☆17Oct 26, 2021Updated 4 years ago
- ☆66Dec 11, 2016Updated 9 years ago
- XBlock to use SCORM content in Open edX. Main development in use_ssla_player branch, requires commercial SSLA player by JCA Solutions.☆12Jun 21, 2023Updated 2 years ago
- EOSVR Introduction.☆16Jul 28, 2019Updated 6 years ago
- Assetto Corsa on the steamdeck is working once again, as you've probably noticed none of the online guides are working, so here is one th…☆17Oct 1, 2023Updated 2 years ago
- ☆19Updated this week
- ☆17Sep 14, 2018Updated 7 years ago
- Stanford CoreNLP NER addon for Apache Tika's NamerEntityParser☆13Feb 26, 2022Updated 4 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- A docker image of PhantomJS 2.0 / GhostDriver that's compatible with selenium grid hub☆28Aug 9, 2016Updated 9 years ago
- Vizlinc☆15Jan 14, 2016Updated 10 years ago
- Codemeta paper.☆10Jul 10, 2017Updated 8 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Feb 11, 2014Updated 12 years ago
- A big data cluster management tool that creates and manages clusters of different technologies.☆21Apr 20, 2015Updated 10 years ago
- A Simple Http to Raw Socket Adapter for Android☆12Aug 30, 2015Updated 10 years ago
- A DropWizard wrapper around Apache Tika.☆10Dec 22, 2016Updated 9 years ago
- OSS2017 - Open Science for Synthesis: Gulf Research Program☆10May 12, 2019Updated 6 years ago
- MEMEX Weapons Pilot for the illegal weapons domain.☆15May 20, 2016Updated 9 years ago
- This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading …☆18Jan 27, 2024Updated 2 years ago
- Image recognition on Spark cluster powered by Deeplearning4j and Apache Tika☆14May 16, 2017Updated 8 years ago
- Simple RESTful API server running your own machine translation model. Docker image modified from mbartoli/easy-smt☆11Apr 28, 2019Updated 6 years ago
- DistributeCrawler的Maven版☆10Jun 20, 2022Updated 3 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆17Sep 11, 2015Updated 10 years ago
- ☆11Jan 16, 2021Updated 5 years ago
- Autoproxy automatically detects proxies and stores them in the respective environment variables (e.g. http_proxy).☆13Oct 2, 2016Updated 9 years ago
- A data management platform for the web☆11Mar 2, 2026Updated 2 weeks ago
- Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum☆18Jul 1, 2022Updated 3 years ago
- Repo for my musical hacks video series☆10Jun 12, 2020Updated 5 years ago
- A dataset downloaded from the deep and scientific web across three major Polar data centers for use in research.☆13Sep 8, 2017Updated 8 years ago
- A nerd's boilerplate for your Python project.☆18Oct 15, 2020Updated 5 years ago
- Docker container to provide Apache Tika RESTful API☆41Feb 12, 2016Updated 10 years ago
- Fixes to Sublime Text's JavaScript symbol list☆30Oct 15, 2014Updated 11 years ago
- 一個源自於 LunaTerm 的開源專案,主要是針對在單手使用上的支援,陸續增加新功能☆14Dec 19, 2011Updated 14 years ago
- Measure text similarity using weighted ngrams.☆18Feb 27, 2014Updated 12 years ago
- [DEPRECATED] Use ipfs-provider instead:☆11May 13, 2020Updated 5 years ago
- WindSR Dataset contains more than 22,000 pairs of HR/LR wind speed images, which are processed using the NASA's GEOS-5 Nature Run dataset…☆12Jan 18, 2024Updated 2 years ago
- Aftertouch MIDI Glove controller☆11Dec 30, 2012Updated 13 years ago
- A Phoenix framework pre-launch example application.☆12Nov 16, 2015Updated 10 years ago