berkmancenter / corpusbuilder
Corpus Build OCR platform
☆8Updated 2 years ago
Alternatives and similar repositories for corpusbuilder:
Users that are interested in corpusbuilder are comparing it to the libraries listed below
- Visual analytics application for qualitative text analysis☆24Updated 2 years ago
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 7 months ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 4 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Extract knowledge from raw text☆13Updated 3 years ago
- Wrapper around pixel classifier☆9Updated 3 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- Python tools for text☆15Updated 4 years ago
- ☆12Updated 2 years ago
- ☆11Updated 5 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- ☆10Updated 5 years ago
- javascript multivariate data visualization☆14Updated 8 years ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆16Updated last year
- Promoss Topic Modelling Toolbox☆11Updated 6 years ago
- Statistical visualizations for Datasette using Seaborn☆12Updated 3 years ago
- A visual analysis tool for exploring multiverse outcomes☆30Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Examples of bad data, especially from government.☆23Updated 8 months ago
- A platform for collecting, analyzing, and visualizing social media data.☆12Updated 4 years ago
- Python API for KB data-services☆19Updated 5 years ago
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 8 years ago
- ☆12Updated last year
- Just charts. Really.☆22Updated last year
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- Uncertainty-aware principal component analysis.☆17Updated 3 years ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- CoreNLG is an easy to use and productivity oriented Python library for Natural Language Generation. It aims to provide the essential tool…☆27Updated 3 years ago