JHUAPL / PINELinks
Collaborative NLP annotation tool supporting enterprise authentication, inter-annotator statistics, active learning
☆14Updated 2 years ago
Alternatives and similar repositories for PINE
Users that are interested in PINE are comparing it to the libraries listed below
Sorting:
- PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolut…☆161Updated 3 years ago
- ☆70Updated 3 years ago
- Titus 2 : Portable Format for Analytics (PFA) implementation for Python 3.4+☆23Updated 2 years ago
- A utility for labeling clusters of text data.☆28Updated 4 years ago
- Repo contains Jupyter notebooks compiled during my review of the programming books listed.☆13Updated 3 years ago
- Machine Learning for Information Retrieval☆86Updated 7 months ago
- TypeDB Driver Example Projects and Tutorials☆86Updated 2 months ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆164Updated 6 months ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated 2 years ago
- DataHub - Synthetic data library☆81Updated 2 years ago
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Updated last year
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 4 years ago
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated 2 years ago
- Python library that classifies content from scientific papers with the topics of the Computer Science Ontology (CSO).☆93Updated last year
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 5 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆191Updated 2 years ago
- Record Linkage ToolKit (Find and link entities)☆111Updated 2 years ago
- Scalable String Similarity Joins in Python☆39Updated last year
- Fuzzy matching and more functionality for spaCy.☆259Updated last year
- Python implementation of anonymous linkage using cryptographic linkage keys☆70Updated last year
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆42Updated 5 years ago
- Library for streaming data and incremental learning algorithms.☆26Updated 3 months ago
- Template for AC297r projects☆33Updated 5 years ago
- A DeepWalk implementation for ontologies using NetworkX and Gensim