berkmancenter / corpusbuilderLinks
Corpus Build OCR platform
☆8Updated 2 years ago
Alternatives and similar repositories for corpusbuilder
Users that are interested in corpusbuilder are comparing it to the libraries listed below
Sorting:
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- Visual analytics application for qualitative text analysis☆24Updated 2 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- Extract knowledge from raw text☆13Updated 3 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 5 years ago
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 9 months ago
- Python tools for text☆15Updated 5 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 3 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last week
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 8 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- ☆12Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 6 years ago
- Simple CTC implementation for PyTorch☆14Updated 7 years ago
- A visual analysis tool for exploring multiverse outcomes☆30Updated 3 years ago
- ☆12Updated 7 years ago
- ☆11Updated 6 years ago
- R package for Multisource Embeddings for Medical Records☆17Updated 3 years ago
- ☆29Updated last year
- Uncertainty-aware principal component analysis.☆18Updated 3 years ago
- Crawling and analyzing data on Wikipedia☆17Updated last year
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated 2 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆18Updated 10 months ago
- A browser extension providing Open Access bibliographical services☆17Updated 2 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- an interactive visual tool for exploring ideologies of political parties from up to date WikiData, using SPARQL, D3js, and PixiJS☆16Updated 3 years ago
- The OpenCitations RDF Resource Browser☆14Updated 3 weeks ago