berkmancenter / corpusbuilder
Corpus Build OCR platform
β8Updated 2 years ago
Alternatives and similar repositories for corpusbuilder:
Users that are interested in corpusbuilder are comparing it to the libraries listed below
- MoodCatπΌ classifies the mood of English sentences.β14Updated 2 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).β14Updated 6 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modelingβ23Updated 4 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Allianceβ12Updated 8 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotationβ14Updated 7 years ago
- Visual analytics application for qualitative text analysisβ24Updated 2 years ago
- Wrapper around pixel classifierβ9Updated 3 years ago
- β11Updated 5 years ago
- A deep learning architecture for reference mining from literature in the arts and humanities.β15Updated 5 years ago
- Sidewall is a Python library for interacting with the Dimensions search API.β17Updated 6 months ago
- An example of how to use spaCy for extremely large files without running into memory issuesβ36Updated 2 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)β25Updated 2 years ago
- Various functions to make bag-of-words approaches to text analysis more user-friendlyβ24Updated 7 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia descriptionβ¦β11Updated 2 years ago
- Introduction to Topic Modeling for TextXD 2019, 12/3/2019β10Updated 5 years ago
- R package for Multisource Embeddings for Medical Recordsβ17Updated 3 years ago
- Crawling and analyzing data on Wikipediaβ16Updated last year
- Visualize a corpus of texts as a landscape with the aid of text mining, graph visualization and self-organizing mapsβ15Updated 3 years ago
- β12Updated 2 years ago
- Install OpenCV within Rβ12Updated 8 years ago
- Open Access PDF harvesterβ39Updated 10 months ago
- Finds linguistic patterns effortlesslyβ35Updated last year
- Implements the model described in "Identification, Interpretability, and Bayesian Word Embeddings"β18Updated 5 years ago
- β12Updated 11 months ago
- β16Updated 3 years ago
- Deep Learning Library for Rβ12Updated 6 years ago
- β10Updated 4 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasetsβ14Updated 4 years ago
- OCR-D post-correction module based on weighted finite-state transducersβ11Updated last year
- Relational NLP: Convert text into relational facts.β9Updated 5 years ago