berkmancenter / corpusbuilder
Corpus Build OCR platform
☆8Updated 2 years ago
Alternatives and similar repositories for corpusbuilder:
Users that are interested in corpusbuilder are comparing it to the libraries listed below
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 5 months ago
- Visual analytics application for qualitative text analysis☆24Updated 2 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- ☆12Updated 5 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Visualize a corpus of texts as a landscape with the aid of text mining, graph visualization and self-organizing maps☆15Updated 3 years ago
- Python tools for text☆15Updated 4 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 4 years ago
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 7 years ago
- ☆17Updated last week
- Statistical visualizations for Datasette using Seaborn☆11Updated 2 years ago
- A visual analysis tool for exploring multiverse outcomes☆31Updated 2 years ago
- A browser extension providing Open Access bibliographical services☆14Updated 2 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- Introduction to Topic Modeling for TextXD 2019, 12/3/2019☆10Updated 5 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆15Updated this week
- qdapTools is an R package that contains tools associated with the qdap package that may be useful outside of the context of text analysis…☆16Updated last year
- Uncertainty-aware principal component analysis.☆17Updated 3 years ago
- A curated list of awesome resources for COVID-19☆37Updated 4 years ago
- Wrapper around pixel classifier☆9Updated 2 years ago
- A financial disclosure data extraction tool.☆13Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆37Updated 5 years ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆15Updated 5 years ago
- Berkeley DLab Python Intensive May 23-26☆27Updated 8 years ago
- ☆18Updated 4 years ago
- ☆12Updated 2 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago