yahoo / tagchowder
Parsing and extracting information from (possibly malformed) HTML/XML documents
☆9Updated 4 months ago
Related projects: ⓘ
- Java scope functions inspired by Kotlin☆12Updated 6 months ago
- Zulia Search Engine☆29Updated this week
- Europeana Cloud is Europeana’s new cloud-based infrastructure for storing and sharing cultural heritage data. It is currently in internal…☆26Updated this week
- Base modules for ADAMS, the Advanced Data Mining and Machine Learning System.☆18Updated this week
- Web/FileSystem Crawler Library☆28Updated last month
- ☆10Updated this week
- A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extr…☆8Updated 3 years ago
- Prime MVC is a high performance Model View Controller framework built in Java.☆10Updated last week
- ☆10Updated 5 years ago
- Visualization of result returning by Solr 6 graph query☆10Updated 8 years ago
- Javascript library to talk to multiple OLAP backends from multiple frontends☆18Updated 11 years ago
- Apache Commons Testing☆8Updated 2 months ago
- Extensible Java Library for reading, manipulating and writing hierarchical data structures from/to various formats.☆14Updated 9 months ago
- Spring integration with Stardog RDF database☆17Updated 2 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆25Updated 2 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆24Updated 6 years ago
- Drop-in replacement for JVM maps when you need to optimize memory or work with large datasets☆12Updated 6 months ago
- Mirror of Apache PDFBox Docs☆26Updated 3 weeks ago
- ☆18Updated last month
- ☆16Updated 8 years ago
- Apache Commons JCI☆14Updated last week
- A java library for creating standalone, portable, schema-full object databases supporting pagination and faceted search, and offering str…☆16Updated 7 years ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services☆25Updated 3 months ago
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- Write JDBC ResultSet to Parquet File☆10Updated 2 weeks ago
- Student Success Plan - Open Source Software Project☆32Updated 7 months ago
- KNIME Deep Learning Integration☆22Updated last week
- An HTTP proxy for Elasticsearch, Solr (etc.) to prevent a 100% full disk situation.☆11Updated 5 years ago
- Lexical categorization engine for large datasets. Good for NLP and Data Mining.☆104Updated 7 years ago
- Windows installer for Groovy☆12Updated 3 years ago