chrismattmann / trec-dd-polarLinks
A dataset downloaded from the deep and scientific web across three major Polar data centers for use in research.
☆13Updated 8 years ago
Alternatives and similar repositories for trec-dd-polar
Users that are interested in trec-dd-polar are comparing it to the libraries listed below
Sorting:
- Uses Apache Lucene, OpenNLP and geonames and extracts locations from text and geocodes them.☆38Updated last year
- Mirror of Apache Stanbol (incubating)☆114Updated last year
- NEXUS is an emerging data-intensive analysis framework developed with a new approach for handling science data that enables large-scale d…☆21Updated 2 years ago
- For interacting with nutch via Python☆29Updated last week
- Scientific Spark - a NASA AIST14 project☆86Updated 7 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Updated 11 years ago
- Vizlinc☆15Updated 9 years ago
- Apache OpenNLP Sandbox☆44Updated this week
- Sparql -> SQL Rewriter enabling virtual RDB -> RDF mappings☆131Updated last year
- Apache Commons RDF☆51Updated this week
- Apache Anything To Triples (Any23) is a library, a web service and a command line tool that extracts structured data in RDF format from a…☆98Updated 2 years ago
- Trending on Accumulo☆40Updated 13 years ago
- Nutch-Python is a Python binding to the Apache Nutch™ REST services allowing Nutch to be called natively in the Python community. — Edit☆39Updated 9 years ago
- Wings workflow system☆50Updated last year
- This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading …☆18Updated last year
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- Common Java interfaces for RDF-1.1 libraries, now in Apache Incubator☆29Updated 5 years ago
- Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.☆108Updated 7 months ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆283Updated 7 years ago
- SKOS analysis for Elasticsearch☆54Updated 9 years ago
- Publishing Big Geospatial data as Linked Open Geospatial Data☆40Updated last year
- Minimal Viable Identifier☆14Updated 3 years ago
- Mirror of Apache Rya☆113Updated 2 years ago
- This project describes the D4M 2.0 Schema used in many Accumulo systems.☆21Updated 5 years ago
- Mirror of Apache OODT☆65Updated 2 years ago
- SIREn - Semi-Structured Information Retrieval Engine☆108Updated 4 years ago
- Codemeta paper.☆10Updated 8 years ago
- Graphulo: Accumulo library of matrix math primitives and graph algorithms☆79Updated 3 months ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- Earth Science Knowledge Graph - An Automatic Approach to Building Earth Science Knowledge Graph to Improve Data Discovery☆20Updated 3 years ago