diffbot / open-source-search-engineLinks
An open source search engine written in C/C++ for Linux on Intel/AMD. From gigablast dot com. See the README.md file below for instructions!!!
☆27Updated 7 years ago
Alternatives and similar repositories for open-source-search-engine
Users that are interested in open-source-search-engine are comparing it to the libraries listed below
Sorting:
- Raw Wikipedia counts for entity linking☆19Updated 8 years ago
- ☆44Updated 10 years ago
- A pipeline for crawling of RSS feeds and the associated content. Demo at newsfeed.ijs.si.☆20Updated 13 years ago
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- UMBEL (Upper Mapping and Binding Exchange Layer)☆102Updated 2 years ago
- Specification of NAF, the NLP annotation format☆21Updated 5 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆47Updated 4 years ago
- Topic modeling web application☆40Updated 10 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- ☆21Updated 7 years ago
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Updated 9 years ago
- JavaScript based graph visualization library with emphasis on customization and modularity.☆13Updated 6 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 4 years ago
- Actor Network Text Analyser☆57Updated 11 years ago
- A POC at replicating Facebook Graph Search with Cypher and Neo4j☆101Updated 12 years ago
- Wikipedia Live Monitor☆22Updated last year
- Imports DBPedia dumps into Neo4j☆32Updated 8 years ago
- A repo that contains outgoing links from DBpedia☆49Updated 5 years ago
- (BROKEN, help wanted)☆15Updated 9 years ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆58Updated 2 years ago
- an open-source data management platform for knowledge workers (https://github.com/dswarm/dswarm-documentation/wiki)☆54Updated 8 years ago
- A toolkit for clustering web pages based on various similarity measures.☆34Updated 4 years ago
- The Open Semantic Framework (OSF) is a generally RESTful middleware API layer that provides the bridge between existing content, structur…☆49Updated 8 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 9 years ago
- ☆13Updated 10 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 3 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆18Updated 11 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 13 years ago
- Turning news into events since 2014.☆51Updated 8 years ago