berkmancenter / corpusbuilderLinks
Corpus Build OCR platform
☆8Updated 2 years ago
Alternatives and similar repositories for corpusbuilder
Users that are interested in corpusbuilder are comparing it to the libraries listed below
Sorting:
- Visual analytics application for qualitative text analysis☆24Updated 2 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- Extract knowledge from raw text☆13Updated 3 years ago
- Wrapper around pixel classifier☆9Updated 3 years ago
- ☆12Updated 7 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- ☆40Updated 7 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 5 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 6 years ago
- Relational NLP: Convert text into relational facts.☆9Updated 5 years ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆63Updated last year
- Python tools for text☆15Updated 5 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Visualize a corpus of texts as a landscape with the aid of text mining, graph visualization and self-organizing maps☆17Updated 3 years ago
- A DeepWalk implementation for ontologies using NetworkX and Gensim☆19Updated 8 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- Recipes for training OpenNMT systems☆14Updated 7 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- Exploring textual and social measures of distance between genres.☆15Updated 6 years ago
- Just charts. Really.☆22Updated last year
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- ☆29Updated last year
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 8 years ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- Finds linguistic patterns effortlessly☆36Updated last year
- Convert a corpus of PDF to clean text files on a distributed architecture☆39Updated last year
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 3 years ago
- Disambiguating biomedical and clinical concepts with word embeddings☆14Updated 7 years ago