forcedotcom / SiteCrawler
This is a Java library which can be used to crawl the content of some of web properties (www.salesforce.com, blogs.salesforce.com for example). It supports dynamic scaling (depending on available machine power (CPU, RAM) and network capacity) out of the box. It also has a Plugin structure, which allows others to write code (plugins) that act on …
☆22Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for SiteCrawler
- Simplified scalable aggregation and processing framework built upon Apache Camel.☆22Updated 6 years ago
- ORM for nosql with SCALABLE SQL☆76Updated 10 years ago
- Live Demonstrations of Java Performance Problems. Instructions: https://github.com/eostermueller/javaPerformanceTroubleshooting/wiki/In…☆20Updated 2 years ago
- UTAM Java implementation☆26Updated this week
- Data abstraction, storage, discovery, and serving system☆31Updated last month
- Java-based library to statistically characterize and randomly generate strings.☆9Updated 6 years ago
- PredictionIO E-Commerce Recommendation Engine Template (Java-based parallelized engine)☆39Updated 5 years ago
- Agorava Core API and Implentations☆50Updated 3 years ago
- Platform to build API applications that have to aggregate data from distributed services in an efficient way.☆22Updated 10 months ago
- Metro has been contributed to Eclipse Foundation. This repository is for legacy review only. Please refer to the Eclipse EE4J Metro proje…☆10Updated 5 years ago
- PowerAuth Server component is the back-end counterpart of PowerAuth Mobile SDK that holds device registrations and verifies MFA signature…☆18Updated this week
- Core API for Silverpeas☆49Updated this week
- Spring Boot API using OrientDB as Database Management System: Document DB☆10Updated 6 years ago
- Distributed processing framework for search solutions☆81Updated last year
- Talend Component Kit (implementation repository)☆32Updated this week
- The core modules and the platform☆36Updated last year
- Java client library for Kill Bill☆33Updated 3 weeks ago
- Lucene plugin for indexing and searching files stored in Baratine distributed filesystem☆16Updated 8 years ago
- Extensible Formula Parser Engine with a Java, SQL, and Javascript execution engine.☆51Updated 4 months ago
- Distributed Elastic Message Processing System☆195Updated 11 months ago
- Detect memory leaks in minutes without a heap dump.☆17Updated 7 years ago
- Apache Maven Doxia base☆28Updated this week
- ☆49Updated 2 years ago
- Crafter Studio authoring environment.☆23Updated this week
- Hackergarden conference app☆13Updated 5 years ago