openredact / nerwhalLinks
This is a prototype of a multi-lingual suite for named-entity recognition in Python.
☆21Updated last year
Alternatives and similar repositories for nerwhal
Users that are interested in nerwhal are comparing it to the libraries listed below
Sorting:
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆45Updated last year
- Combining encoder-based language models☆11Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Custom Natural Language Processing with big and small models 🌲🌱☆68Updated 4 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 9 months ago
- ☆43Updated 2 years ago
- ☆30Updated 3 years ago
- Hyperparameter search for AllenNLP - powered by Ray TUNE☆28Updated 6 months ago
- Data Programming by Demonstration (DPBD) for Document Classification☆35Updated 4 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Updated 3 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated 8 months ago
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Updated 4 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35Updated 5 years ago
- A text augmentation tool for named entity recognition.☆54Updated 4 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 5 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated last year
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Updated 10 months ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆42Updated 5 years ago
- A python package to simulate typographical errors.☆37Updated last year
- Repository for the paper "Named Entity Recognition for Entity Linking: What Works and What's Next" (EMNLP 2021).☆76Updated 3 years ago
- Model for predicting categories of entities by its mentions☆29Updated 4 years ago
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆21Updated 2 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 3 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Updated 4 years ago
- A collection of selected of models built with AllenNLP.☆25Updated 5 years ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 4 years ago