NationalLibraryOfNorway / NB-N-gramLinks
A trend viewer written in Python/JavaScript
☆21Updated 8 months ago
Alternatives and similar repositories for NB-N-gram
Users that are interested in NB-N-gram are comparing it to the libraries listed below
Sorting:
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- Python implementation of the Zeta score for contrastive text analysis☆14Updated 4 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
- A design prototype for DocNow to learn with☆14Updated 8 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 7 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 8 years ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- Text Re-use Alignment Visualization☆38Updated 7 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆109Updated 10 years ago
- System for building, visualizing, and working with LDA topic models☆97Updated 3 weeks ago
- Python package for harvesting records from OAI-PMH provider(s).☆64Updated 3 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆109Updated 4 years ago
- a CLI suggestion tool for Wikidata entities☆30Updated 8 years ago
- Detect and visualize text reuse☆118Updated 11 months ago
- The oaipmh module is a Python implementation of an "Open Archives$ Initiative Protocol for Metadata Harvesting"☆87Updated 2 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆116Updated 9 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Basic dataset for the linguistic data collection.☆15Updated 8 years ago
- A textual corpus database for the digital humanities.☆61Updated 5 years ago
- This repository contains tool and collections dataset for detecting off-topic pages from Web archived collections.☆18Updated 9 years ago
- ☆16Updated 10 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated this week
- A simple configurable tool for manipulating dependency trees.☆14Updated 7 months ago
- NYT Risk Semantics Project☆12Updated 9 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆50Updated 2 years ago
- Entity Extraction Text Processor☆147Updated last year
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago