ljvmiranda921 / prodigy-pdf-custom-recipe
Custom recipe and utilities for document processing
☆199Updated 2 years ago
Alternatives and similar repositories for prodigy-pdf-custom-recipe:
Users that are interested in prodigy-pdf-custom-recipe are comparing it to the libraries listed below
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆212Updated last month
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated last year
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆71Updated last year
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated 11 months ago
- Streamline scikit-learn model comparison.☆146Updated 2 years ago
- just a bunch of useful embeddings for scikit-learn pipelines☆482Updated last month
- Spacy NER annotator using ipywidgets☆119Updated 11 months ago
- A Simple Bulk Labelling Tool☆567Updated 2 months ago
- Quote extraction for modular journalism (JournalismAI collab 2021)☆227Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Gain clues from clustering!☆312Updated 7 months ago
- A comprehensive reference for all topics related to building and maintaining microservices☆67Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 9 months ago
- Public runnable examples of using John Snow Labs' OCR for Apache Spark.☆90Updated last month
- A Python library for calculating a large variety of metrics from text☆324Updated 2 months ago
- A data labelling tool based on Streamlit.☆23Updated 3 years ago
- Few-shot Named Entity Recognition☆123Updated 2 years ago
- SpanMarker for Named Entity Recognition☆419Updated last month
- 💥 Explosion Assets☆43Updated last year
- ☆81Updated 2 years ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆242Updated 9 months ago
- Label data using HuggingFace's transformers and automatically get a prediction service☆183Updated last year
- Doubt your data, find bad labels.☆508Updated 7 months ago
- Mastering spaCy, published by Packt☆129Updated last year
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- Super Simple Similarities Service☆142Updated last year
- Nesta's Skills Extractor Library☆127Updated 4 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 6 months ago