diffbot / open-source-search-engine
An open source search engine written in C/C++ for Linux on Intel/AMD. From gigablast dot com. See the README.md file below for instructions!!!
☆23Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for open-source-search-engine
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- DBpedia Distributed Extraction Framework: Extract structured data from Wikipedia in a parallel, distributed manner☆41Updated 2 years ago
- Automatic tagging and analysis of documents in an Apache Solr index for faceted search by RDF(S) Ontologies & SKOS thesauri☆46Updated 2 years ago
- A POC at replicating Facebook Graph Search with Cypher and Neo4j☆102Updated 11 years ago
- Exploits Wikipedia's daily view counts to find out what topics are current trends☆17Updated 11 years ago
- This is a set of ontologies used by different parts of the Open Semantic Framework. These ontologies should normally be loaded in OSF usi…☆14Updated 10 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆34Updated 8 years ago
- Traptor -- A distributed Twitter feed☆26Updated 2 years ago
- A Relaxed Schema Graph Database Management System☆53Updated 4 years ago
- Lightweight, multilingual natural language processing☆63Updated 11 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- A simple Web crawler for stackshare.io using scrapy .☆9Updated 5 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated last month
- Mirror of Apache Stanbol (incubating)☆112Updated 8 months ago
- Models and serializers for ontologies and related artifacts backed by 4store☆18Updated 2 weeks ago
- Scraper built with Scrapy.☆14Updated 2 months ago
- Web-based synthesis of nifty NLP and entity extraction services☆13Updated 5 years ago
- Semanticizest: dump parser and client☆20Updated 8 years ago
- D3 and Play based visualization for entity-relation graphs, especially for NLP and information extraction☆29Updated 9 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆23Updated 8 years ago
- Collects multimedia content shared through social networks.☆19Updated 9 years ago
- The inpho model and dataprocessing tools. Interface between codex and inphosite☆18Updated 4 years ago
- Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.☆23Updated 8 years ago
- A repo that contains outgoing links from DBpedia☆50Updated 4 years ago
- Browser version of Hyphe (WIP)☆29Updated 3 weeks ago
- PredictionIO word2vec engine template (Scala-based parallelized engine)☆12Updated 9 years ago
- This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by th…☆44Updated 2 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆112Updated 8 years ago