Apache Nutch fork tunned for web services and data discovery.
β10May 18, 2015Updated 10 years ago
Alternatives and similar repositories for nutch-crawler
Users that are interested in nutch-crawler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β25Apr 6, 2015Updated 10 years ago
- 𧬠Generate secure by default cloud infrastructure configuration with Go and Terraform.β12Jan 23, 2024Updated 2 years ago
- Prion-Like Amino Acid Compositionβ18Dec 15, 2025Updated 3 months ago
- Nutch with Cassandra and Elasticsearch on Dockerβ17Oct 26, 2021Updated 4 years ago
- Harden of the AMS Linux 2β11May 14, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A simple task manager for pesonal use inspired by trello interface.β15Sep 11, 2020Updated 5 years ago
- β28Jun 9, 2016Updated 9 years ago
- A python wrapper to the NASA Common Metadata Repository APIβ20Oct 14, 2021Updated 4 years ago
- PyPop: Python for Population Genomicsβ25Updated this week
- WARNING- This package is no longer supported and will be replaced in the near future. A solution that enables customers to easily create β¦β16Mar 28, 2018Updated 7 years ago
- β23Dec 3, 2020Updated 5 years ago
- An Akka actor that writes JSON data into Amazon Kinesis Firehose.β15Jan 14, 2026Updated 2 months ago
- Teaching data visualization at Columbia University.β10Oct 2, 2015Updated 10 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translationβ15Dec 2, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This is a gem that provides the ability to create a workspace, import scan data from nexpose, and perform a webscan, a web audit, and perβ¦β10Dec 13, 2017Updated 8 years ago
- Examples of how Python can speed up tasks that are cumbersome in Excelβ13Oct 5, 2016Updated 9 years ago
- Hadoop integration code for working with with Apache cTAKESβ10Feb 11, 2014Updated 12 years ago
- A data management system for electronic tags on marine animalsβ13Mar 31, 2025Updated 11 months ago
- The BES framework, which forms the basis for the Hyrax serverβ16Updated this week
- Module 1: Open Principlesβ37Nov 14, 2019Updated 6 years ago
- Xapian full text search plugin for Ruby on Railsβ128Aug 29, 2018Updated 7 years ago
- A couple projects using scikit-learn illustrating project decision making.β15Oct 8, 2016Updated 9 years ago
- β13Apr 11, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A dataset downloaded from the deep and scientific web across three major Polar data centers for use in research.β13Sep 8, 2017Updated 8 years ago
- JAWS is "Just A Web Shell" framework for delivering Force.com web applications to iOS (iPhone/iPad) devices.β14Mar 27, 2011Updated 14 years ago
- Unmaintained templating system used by old versions of Supervisorβ21Nov 15, 2022Updated 3 years ago
- Repository for revision of PREMIS OWL ontology groupβ13May 12, 2022Updated 3 years ago
- The CMR Metadata Review tool is used to curate NASA EOSDIS collection and granule level metadata in CMR for correctness, completeness andβ¦β25Sep 4, 2025Updated 6 months ago
- Packer Serverspec remote provisionerβ32Feb 25, 2023Updated 3 years ago
- Create CovJSON files from common scientific data formatsβ14Apr 24, 2018Updated 7 years ago
- A Rails adapter for test-unitβ11Nov 22, 2025Updated 4 months ago
- ifcParserLib is a set of reusable Java components that implement functionality for IFC file parsing.β10Oct 14, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [DEPRECATED] Use ipfs-provider instead:β11May 13, 2020Updated 5 years ago
- Template scripts for creating new rails applications.β134Nov 2, 2009Updated 16 years ago
- Mirror of Apache Edgent (Incubating) Samplesβ15Feb 14, 2018Updated 8 years ago
- Code for the paper Faster Phrase-Based Decoding by Refining Feature Stateβ14Jan 9, 2023Updated 3 years ago
- RESTful wrapper for the Joshua machine translation decoderβ14Oct 25, 2016Updated 9 years ago
- Gee is a modified version of Oliver Lloyd's JMeter-EC2 project.β16Jan 2, 2014Updated 12 years ago
- Distributed version restore tool for S3β12Jan 5, 2015Updated 11 years ago