diffbot / open-source-search-engine
An open source search engine written in C/C++ for Linux on Intel/AMD. From gigablast dot com. See the README.md file below for instructions!!!
☆23Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for open-source-search-engine
- DBpedia Distributed Extraction Framework: Extract structured data from Wikipedia in a parallel, distributed manner☆41Updated 2 years ago
- Elwha is a Java application for monitoring topics, sentiment and events on Twitter streams with the ability to generate notification mess…☆14Updated 9 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆46Updated 2 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- A POC at replicating Facebook Graph Search with Cypher and Neo4j☆102Updated 11 years ago
- This is a set of ontologies used by different parts of the Open Semantic Framework. These ontologies should normally be loaded in OSF usi…☆14Updated 10 years ago
- Raw Wikipedia counts for entity linking☆19Updated 7 years ago
- A toolkit for clustering web pages based on various similarity measures.☆32Updated 3 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- D3 and Play based visualization for entity-relation graphs, especially for NLP and information extraction☆29Updated 9 years ago
- Scraper built with Scrapy.☆14Updated 3 months ago
- Specification of NAF, the NLP annotation format☆21Updated 3 years ago
- A distributed system for mining common crawl using SQS, AWS-EC2 and S3☆14Updated 10 years ago
- Pollster polls for share counts of URLs at regular intervals.☆47Updated 9 years ago
- Prototype plugin to support topic modeling using LDA in Elasticsearch☆20Updated 8 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- A pipeline for crawling of RSS feeds and the associated content. Demo at newsfeed.ijs.si.☆21Updated 12 years ago
- Pattern-of-Behavior Search Tool☆11Updated 2 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- Lightweight, multilingual natural language processing☆63Updated 11 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- Vizlinc☆14Updated 8 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- ☆13Updated 8 years ago
- ☆42Updated 8 years ago
- Events and Situations Ontology☆13Updated 6 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆55Updated 3 years ago
- Browser add-on and web server to support collection and analysis of web browsing data.☆13Updated 8 years ago
- Implicit relation extractor using a natural language model.☆25Updated 6 years ago