oduwsdl / QueryClassification
Source code for domain classification (scholar or non-scholar) of a web query.
☆11Updated 8 years ago
Alternatives and similar repositories for QueryClassification:
Users that are interested in QueryClassification are comparing it to the libraries listed below
- ☆16Updated 9 years ago
- Collaborative collection development for web archives☆18Updated 5 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆43Updated 7 years ago
- This repository contains tool and collections dataset for detecting off-topic pages from Web archived collections.☆18Updated 9 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 2 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 7 years ago
- Topic Modeling Workflow in Python☆16Updated last year
- Rails application for the Archives Unleashed Cloud.☆11Updated 3 years ago
- WASAPI data transfer APIs☆43Updated 2 years ago
- CI scripts for validating and processing metadata☆11Updated 5 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Efficient indexing and retrieval of OCR bounding boxes in Solr☆22Updated 5 years ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆37Updated 5 years ago
- Bibliographic Entity Automatic Recognition and Disambiguation☆65Updated 4 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- Text-Induced Corpus Clean-up☆20Updated last year
- Python package for harvesting records from OAI-PMH provider(s).☆62Updated 2 years ago
- No longer maintained. Please use conciliator instead.☆26Updated 4 years ago
- rightsstatements.org data model☆12Updated 2 years ago
- Lakesuperior, an alternative Fedora Repository implementation☆32Updated 2 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆113Updated 8 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year
- A collection of ipython/jupyter notebooks☆16Updated 6 years ago
- Archive Research Services Workshop☆31Updated 7 years ago
- DoSeR with entity disambiguation components only☆16Updated 6 years ago
- Download digitized books from Internet Archive and view with IIIF, locally and offline.☆36Updated 9 months ago
- Tools to analyze web archives☆20Updated 8 years ago
- Training files produced for and by the Tesseract OCR engine for work on the Early Modern OCR Project (eMOP)☆36Updated 9 years ago