diffbot / open-source-search-engine
An open source search engine written in C/C++ for Linux on Intel/AMD. From gigablast dot com. See the README.md file below for instructions!!!
☆25Updated 6 years ago
Alternatives and similar repositories for open-source-search-engine:
Users that are interested in open-source-search-engine are comparing it to the libraries listed below
- DBpedia Distributed Extraction Framework: Extract structured data from Wikipedia in a parallel, distributed manner☆41Updated 2 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆56Updated 3 years ago
- Scraper built with Scrapy.☆14Updated 5 months ago
- Pollster polls for share counts of URLs at regular intervals.☆47Updated 9 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆46Updated 3 years ago
- This is a set of ontologies used by different parts of the Open Semantic Framework. These ontologies should normally be loaded in OSF usi…☆14Updated 10 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.☆23Updated 8 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆34Updated 8 years ago
- LINKED DATA QUALITY REPORTS☆41Updated 2 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- JSON schemas for OpenCorporates data☆19Updated 7 months ago
- RESTful API around the PETRARCH coding software☆10Updated 3 years ago
- BatchRefine adds batch processing capabilities to OpenRefine☆50Updated 8 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- OntoWiki component to visualize RDF-DataCubes☆34Updated 8 years ago
- The inpho model and dataprocessing tools. Interface between codex and inphosite☆18Updated 5 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- Imports DBPedia dumps into Neo4j☆31Updated 7 years ago
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆53Updated 6 months ago
- A Relaxed Schema Graph Database Management System☆52Updated 4 years ago
- Jupyter Notebooks presenting Frictionless Data.☆9Updated 3 years ago
- Alignment, a collaborative, system aided, user driven ontology/vocabulary matching and validation platform.☆12Updated 2 years ago
- framework for making streamcorpus data☆11Updated 7 years ago
- an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)☆54Updated 7 years ago
- Mirror of Apache Stanbol (incubating)☆112Updated 10 months ago