SkBlaz / rakun2
RaKUn 2.0 - A fast keyword detection algorithm
โ65Updated this week
Alternatives and similar repositories for rakun2:
Users that are interested in rakun2 are comparing it to the libraries listed below
- A Python library aimed at dissecting and augmenting NER training data.โ58Updated last year
- ๐ซ SpaCy wrapper for ConceptNet ๐ซโ89Updated last year
- Few-shot Named Entity Recognitionโ123Updated 2 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.โ105Updated 10 months ago
- Explainable Zero-Shot Topic Extractionโ62Updated 6 months ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Taggingโ65Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ54Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.โ104Updated 9 months ago
- Code accompanying the submission "Structural Text Segmentation of Legal Documents" by Aumiller et al.โ96Updated last year
- Source code and data for Like a Good Nearest Neighborโ28Updated last month
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ151Updated 8 months ago
- โ84Updated 5 months ago
- โ155Updated 8 months ago
- Generalist and Lightweight Model for Text Classificationโ79Updated this week
- Creating class-based TF-IDF matricesโ82Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsโ65Updated 2 years ago
- โ42Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality โฆโ106Updated 11 months ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puโฆโ40Updated 3 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to iโฆโ46Updated 10 months ago
- ๐งช Cutting-edge experimental spaCy components and featuresโ96Updated 9 months ago
- Mining Legal Arguments in Court Decisions - Data and softwareโ66Updated last year
- Vespa application making an index of the CORD-19 dataset.โ39Updated last month
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 qโฆโ86Updated 11 months ago
- A spaCy wrapper for DBpedia Spotlightโ108Updated last year
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.โ32Updated 9 months ago
- Data programming by demonstration for information extraction and span annotationโ35Updated 3 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.โ76Updated last year
- Information extraction from English and German texts based on predicate logicโ135Updated last year
- Open source library for few shot NLPโ77Updated last year