SkBlaz / rakun2
RaKUn 2.0 - A fast keyword detection algorithm
β66Updated last month
Alternatives and similar repositories for rakun2:
Users that are interested in rakun2 are comparing it to the libraries listed below
- π« SpaCy wrapper for ConceptNet π«β90Updated last year
- β84Updated 6 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 10 months ago
- Source code and data for Like a Good Nearest Neighborβ28Updated 2 months ago
- Few-shot Named Entity Recognitionβ123Updated 2 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' puβ¦β40Updated 3 years ago
- β43Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.β105Updated 11 months ago
- Explainable Zero-Shot Topic Extractionβ62Updated 7 months ago
- A Python library aimed at dissecting and augmenting NER training data.β58Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.β106Updated 10 months ago
- Semantically Structured Sentence Embeddingsβ65Updated 5 months ago
- Open source library for few shot NLPβ77Updated last year
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.β86Updated 2 months ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.β44Updated 10 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β78Updated last year
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrievalβ29Updated 2 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.β37Updated 3 years ago
- Creating class-based TF-IDF matricesβ83Updated 2 years ago
- Vespa application making an index of the CORD-19 dataset.β39Updated 2 months ago
- KeyPhraseTransformer lets you quickly extract key phrases, topics, themes from your text data with T5 transformer | Keyphrase extractionβ¦β104Updated 9 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β80Updated 8 months ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- Robust and fast topic models with sentence-transformers.β48Updated this week
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 qβ¦β87Updated last year
- π§ͺ Cutting-edge experimental spaCy components and featuresβ97Updated 11 months ago
- A Dataset for Direct Quotation Extraction and Attribution in News Articles.β13Updated 3 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further langβ¦β121Updated 11 months ago
- Using short models to classify long textsβ21Updated 2 years ago