oduwsdl / QueryClassificationLinks
Source code for domain classification (scholar or non-scholar) of a web query.
☆11Updated 9 years ago
Alternatives and similar repositories for QueryClassification
Users that are interested in QueryClassification are comparing it to the libraries listed below
Sorting:
- ☆16Updated 10 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆47Updated 8 years ago
- This repository contains tool and collections dataset for detecting off-topic pages from Web archived collections.☆18Updated 10 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- Bibliographic Entity Automatic Recognition and Disambiguation☆65Updated 5 years ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 8 years ago
- ☆18Updated 8 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆19Updated 5 years ago
- A design prototype for DocNow to learn with☆14Updated 8 years ago
- Text-Induced Corpus Clean-up☆20Updated 2 years ago
- framework for doing NER and other types of entity recognition, in Python☆68Updated 3 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed…☆155Updated 2 months ago
- Standalone Semanticizer☆32Updated 10 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 3 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆28Updated 4 years ago
- Version 1.0 of the CrowdTruth Framework for crowdsourcing ground truth data, for training and evaluation of cognitive computing systems. …☆60Updated 7 years ago
- Learning String Alignments for Entity Aliases☆37Updated 6 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆61Updated 7 years ago
- a CLI suggestion tool for Wikidata entities☆30Updated 9 years ago
- Tools for creating DBpedia Spotlight Lucene Index☆10Updated 3 years ago
- System for building, visualizing, and working with LDA topic models☆97Updated last week
- Warcbase is an open-source platform for managing analyzing web archives☆161Updated 8 years ago
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in t…☆131Updated last month
- Scripts and microservice to feed an ElasticSearch with Wikidata and Inventaire entities, and keep those up-to-date☆41Updated 5 years ago
- Temporal Expression Recognition and Normalisation in Python☆77Updated 9 years ago
- Outputs a list of ranked DBpedia resources for a search string.☆187Updated 4 years ago
- My implementation of Explicit Semantic Analysis (ESA) library that we used at KMi, Open University to produce our submission at the NTCIR…☆36Updated 10 years ago
- Knowledge extraction from web data☆92Updated 7 years ago