forcedotcom / SiteCrawlerLinks
This is a Java library which can be used to crawl the content of some of web properties (www.salesforce.com, blogs.salesforce.com for example). It supports dynamic scaling (depending on available machine power (CPU, RAM) and network capacity) out of the box. It also has a Plugin structure, which allows others to write code (plugins) that act on …
☆23Updated last month
Alternatives and similar repositories for SiteCrawler
Users that are interested in SiteCrawler are comparing it to the libraries listed below
Sorting:
- Simplified scalable aggregation and processing framework built upon Apache Camel.☆22Updated 6 years ago
- Mirror of Apache Cocoon☆28Updated last month
- Implementation of the SOA Repository Artifact Model and Protocol (S-RAMP)☆18Updated 8 years ago
- Web-based management process and task management console☆54Updated 9 months ago
- Apache Commons JXPath☆34Updated this week
- Data abstraction, storage, discovery, and serving system☆32Updated 3 months ago
- The core modules and the platform☆36Updated last year
- Secure REST service to index, search, retrieve and aggregate content from heterogeneous sources.☆20Updated 9 months ago
- everREST project it is RESTful application framework along with complete JAX-RS (JSR-311) implementation☆27Updated 2 years ago
- ☆14Updated 2 months ago
- Core API for Silverpeas☆50Updated last week
- Very basic web app project that grabs a twitter stream and runs it through Stanfords Core NLP☆10Updated 9 years ago
- Develop streaming applications for IBM Streams in Python, Java & Scala.☆29Updated 2 years ago
- PredictionIO E-Commerce Recommendation Engine Template (Java-based parallelized engine)☆39Updated 6 years ago
- Aggregated SwitchYard repository☆12Updated 7 years ago
- Application transformation tool☆50Updated 3 weeks ago
- Detect memory leaks in minutes without a heap dump.☆17Updated 8 years ago
- Live Demonstrations of Java Performance Problems. Instructions: https://github.com/eostermueller/javaPerformanceTroubleshooting/wiki/In…☆20Updated 3 years ago
- Mirror of Apache MetaModel Membrane☆16Updated 6 years ago
- Common libraries for adding 'nosql' style analytics to your application☆13Updated 3 years ago
- mustache templates, used to generate RESTful API Document with the help of Swagger☆28Updated 7 years ago
- Crafter Studio authoring environment.☆24Updated this week
- A HTTP cache for Java☆29Updated 3 years ago
- Plexus Utils☆35Updated last month
- Mirror of Apache Maven pom. Repo retired, see https://github.com/apache/maven-parent and https://github.com/apache/maven-apache-parent☆16Updated 6 years ago
- Uberfire Framework☆83Updated 6 years ago
- Provides first-class support for running Spring / Spring Boot applications as cloud functions inside “server less” architectures, such as…☆8Updated 8 years ago
- Plivo Java helper Library☆35Updated 2 months ago
- Mirror of Apache NiFi to support ongoing MarkLogic integration efforts☆13Updated last month
- Java-based library to statistically characterize and randomly generate strings.☆9Updated 7 years ago