yahoo / tagchowder
Parsing and extracting information from (possibly malformed) HTML/XML documents
☆9Updated 9 months ago
Alternatives and similar repositories for tagchowder:
Users that are interested in tagchowder are comparing it to the libraries listed below
- Javascript library to talk to multiple OLAP backends from multiple frontends☆18Updated 12 years ago
- Visualization of result returning by Solr 6 graph query☆10Updated 8 years ago
- Web/FileSystem Crawler Library☆29Updated this week
- Mirror of Apache OpenNLP Add-ons☆17Updated 2 weeks ago
- A smart distributed crawler that infers navigation models of structured websites, used to cluster pages based on their structure and extr…☆8Updated 3 years ago
- Generic library shared between several projects.☆13Updated 2 weeks ago
- Solr Relevance Ranking Analysis and Visualization Tool☆17Updated 5 years ago
- Distributed processing framework for search solutions☆81Updated 2 years ago
- Simplified scalable aggregation and processing framework built upon Apache Camel.☆22Updated 6 years ago
- Twitter sentiment analysis using Spark and Stanford CoreNLP and visualization using elasticsearch and kibana☆20Updated 7 years ago
- This plugin provides a useful feature for multi-language☆14Updated 2 years ago
- Text similarity based on Word2Vec vectors.☆11Updated 8 years ago
- A library to store metadata of relational databases including the schema, statistics, and integrity constraints.☆25Updated 6 years ago
- Europeana Cloud is Europeana’s new cloud-based infrastructure for storing and sharing cultural heritage data. It is currently in internal…☆26Updated last week
- Windows installer for Groovy☆12Updated 3 years ago
- HTML Form Entry module☆41Updated last week
- Planning feature for Superdesk☆11Updated this week
- Java library for Concrete, a data serialization format for NLP☆6Updated 5 years ago
- This is a Java library which can be used to crawl the content of some of web properties (www.salesforce.com, blogs.salesforce.com for exa…☆22Updated 3 years ago
- A library to implement event-sourcing microservices☆16Updated this week
- A tool that takes an image based content article and automatically generates a motion video out of it.☆20Updated 2 years ago
- Code and Data Samples for Big Data Warehousing.☆10Updated 9 years ago
- An HTTP proxy for Elasticsearch, Solr (etc.) to prevent a 100% full disk situation.☆11Updated 6 years ago
- Java Sketch Characterization Code.☆11Updated 2 weeks ago
- The first Open Source document analysis platform☆65Updated 3 years ago
- fuzzydb is a fuzzy matching database engine capable of providing human-like search results that make life much easier for users of websit…☆19Updated last year
- ☆9Updated 9 years ago
- Plugin to push elasticsearch data to newrelic☆44Updated 11 years ago
- Building recommenders with Elastic Graph!☆37Updated 4 years ago
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Updated 8 years ago