fivesmallq / web-data-extractor
Extracting and parsing structured data with jQuery Selector, XPath or JsonPath from common web format like HTML, XML and JSON.
☆54Updated last year
Alternatives and similar repositories for web-data-extractor:
Users that are interested in web-data-extractor are comparing it to the libraries listed below
- Java agent string parser based on Udger https://udger.com/products/local_parser☆26Updated last year
- A new solr multilingual index and search architecture, it can support index and search across multiple languages at the same time in the …☆13Updated 5 years ago
- 一款基于Java注解的elasticsearch mapping生成工具,支持ES 5.2.0所有可选参数☆21Updated 7 years ago
- Web/FileSystem Crawler Library☆29Updated 3 weeks ago
- Neuro4j Workflow is a light-weight workflow engine for Java with Eclipse-based development environment. Workflow allows to build reusable…☆60Updated 5 years ago
- java puppeteer☆16Updated 6 months ago
- (Deprecated) Compile-time transformer to run Groovy code in a restrictive sandbox☆125Updated 2 months ago
- Implementation of the new headless chrome with chromedriver and selenium.☆38Updated 5 years ago
- Scheduler for Elasticsearch plugins☆17Updated 10 years ago
- Concentrated on solving java components conflict problem!☆25Updated 2 years ago
- 对spring中使用mybatis做了增强,提供读写分离,配置热加载,基础CRUD操作等支持☆19Updated 9 years ago
- Open Source ETL designed for and dedicated to Log processing and transformation☆68Updated 2 years ago
- collect free proxies, and check their availability☆23Updated 7 years ago
- dyanmic spring bean,controller/entity/dao/propertyEditor/manager(service)/interceptor bean compliled directly from java file real time li…☆11Updated 9 years ago
- A library enabling DAG structuring of data processing programs such as ETLs☆16Updated 2 weeks ago
- 蜜蜂牧场是一个数据采集清洗工具,也是一个ETL工具,同时也是一套脚本语言。☆13Updated 6 years ago
- Provides simplified access to the ElasticSearch Java API.☆4Updated 3 years ago
- A modern WebSite and Web services framework with built in Async events, HTTP server,client and WebSocket server,client, all powered by Ne…☆70Updated last year
- Lightweight embedded java full text search engine☆13Updated 4 years ago
- Java 编写的 SSDB 客户端,支持负载均衡☆55Updated 3 months ago
- Single file examples and ready-to-use servers show how to use parallec.io library. Examples to aggregate APIs and publish to Elastic Sear…☆92Updated 7 years ago
- Parsing SQL to MongoDB☆26Updated 4 years ago
- ☆11Updated 5 years ago
- 从网上找到的一个将one CMDB代码用maven的 项目.便于代码的二次开发.感谢konca.☆18Updated 12 years ago
- simple implementation of arthas☆9Updated 2 years ago
- a simple performance test framework for java, 一个简单的Java性能测试框架☆22Updated last year
- Metrics + Hyperic Sigar for OS-level monitoring☆77Updated 10 years ago
- convert excel rows to javabeans and vice visa☆19Updated last year
- rules-engine, based on easy-rules☆33Updated 5 years ago
- Solr Redis Extensions☆52Updated 11 months ago