berkmancenter / corpusbuilderLinks
Corpus Build OCR platform
☆8Updated 2 years ago
Alternatives and similar repositories for corpusbuilder
Users that are interested in corpusbuilder are comparing it to the libraries listed below
Sorting:
- Visual analytics application for qualitative text analysis☆24Updated 2 years ago
- ☆11Updated 6 years ago
- Performs unique entity estimation corresponding to Chen, Shrivastava, Steorts (2018).☆14Updated 6 years ago
- ☆12Updated 2 years ago
- Just charts. Really.☆22Updated last year
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 6 years ago
- A text processing pipeline for turning unstructured text data into hierarchical datasets☆14Updated 5 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Dataiku DSS plugin to detect languages, correct misspellings, and clean text data 🧼☆22Updated 6 months ago
- Visualize a corpus of texts as a landscape with the aid of text mining, graph visualization and self-organizing maps☆22Updated 3 years ago
- Interactive scalable auditing of model biases and vulnerabilities with interpretable mitigation☆24Updated 3 years ago
- Scripts to take hand washing related text in (almost) any language and float it into a hand washing poster.☆9Updated 4 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 3 years ago
- Ricgraph - Research in context graph☆29Updated last week
- ☆10Updated 5 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated 2 weeks ago
- Code supporting the dissertation "Agents in Conflict," George Mason University, 2016☆21Updated 9 years ago
- Tools for working with book data☆18Updated last month
- Machine Learning-assisted correction of OCR errors in historical corpora☆10Updated 8 months ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated this week
- Extracting narrative timelines (i.e. order and timing of events) from text☆20Updated 6 years ago
- ☆15Updated 3 years ago
- Topic supervised non-negative matrix factorization with sparse matrices☆12Updated 5 years ago
- A visual analysis tool for exploring multiverse outcomes☆31Updated 3 years ago
- ☆19Updated last year
- Extract knowledge from raw text☆13Updated 3 years ago
- A text mining tool for developing visual and interactive relationship networks from PubMed article information.☆15Updated 11 months ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆16Updated 2 years ago