yahoo / tagchowderLinks
Parsing and extracting information from (possibly malformed) HTML/XML documents
☆10Updated last year
Alternatives and similar repositories for tagchowder
Users that are interested in tagchowder are comparing it to the libraries listed below
Sorting:
- ☆16Updated 8 years ago
- Solr Relevance Ranking Analysis and Visualization Tool☆15Updated 5 years ago
- Solr AutoComplete implementation☆59Updated 7 years ago
- Solr SearchComponent for altering and re-executing queries that product poor results☆14Updated 4 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- Solrstrap is a Query-Result interface for Solr written in JavaScript, HTML and CSS☆87Updated 8 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 3 years ago
- Advanced desktop search/corpus exploration prototype☆21Updated 4 years ago
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆26Updated last month
- SKOS Support for Apache Lucene and Solr☆56Updated 4 years ago
- Zulia Search Engine☆33Updated this week
- Base modules of JCoRe☆22Updated last year
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated 7 months ago
- Java library for reading and writing WARC files with a typed API☆50Updated last month
- SOLR bulk indexing utility for the command line.☆44Updated last month
- Common web archive utility code.☆56Updated last month
- Solr Redis Extensions☆53Updated last year
- Wandora is a general purpose information extraction, management and publishing application based on Topic Maps and Java.☆133Updated last year
- Core package of the Metafacture tool suite for metadata processing.☆73Updated this week
- Example SPARQL queries, mostly for working with ZBW data sets☆16Updated this week
- Javascript library to talk to multiple OLAP backends from multiple frontends☆17Updated 12 years ago
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆21Updated 3 years ago
- The YQL+ parser, execution engine, and source SDK.☆42Updated 2 years ago
- This is the frontend layer of SearchX. SearchX is a scalable collaborative search system being developed by Lambda Lab of TU Delft.☆15Updated 2 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆115Updated last week
- TeXoo – A Zoo of Text Extractors☆18Updated 5 years ago
- an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)☆54Updated 7 years ago
- Simple RESTful API server running your own machine translation model. Docker image modified from mbartoli/easy-smt☆11Updated 6 years ago
- Ergonomic line-by-line transcription of scanned text.☆53Updated 4 years ago
- T2K Match is a matching algorithm optimised to match millions of web tables to a central knowledge base.☆21Updated 7 years ago