ljvmiranda921 / prodigy-pdf-custom-recipe
Custom recipe and utilities for document processing
☆198Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for prodigy-pdf-custom-recipe
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆242Updated last year
- 💥 Explosion Assets☆42Updated 11 months ago
- Spacy NER annotator using ipywidgets☆121Updated 7 months ago
- Quote extraction for modular journalism (JournalismAI collab 2021)☆226Updated 2 years ago
- Public runnable examples of using John Snow Labs' OCR for Apache Spark.☆89Updated last week
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 8 months ago
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆85Updated 2 years ago
- Gain clues from clustering!☆305Updated 4 months ago
- just a bunch of useful embeddings☆466Updated 2 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- A Simple Bulk Labelling Tool☆552Updated 2 months ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆68Updated 11 months ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆320Updated last year
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆209Updated 5 months ago
- Streamline scikit-learn model comparison.☆146Updated last year
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆139Updated 7 months ago
- A comprehensive reference for all topics related to building and maintaining microservices☆67Updated last year
- A Python library for calculating a large variety of metrics from text☆315Updated last month
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆73Updated 2 years ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆243Updated 6 months ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated last year
- ☆81Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆61Updated 3 months ago
- SpanMarker for Named Entity Recognition☆401Updated 3 months ago
- A simple component to display annotated text in Streamlit apps.☆523Updated last month
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆153Updated 2 years ago
- For pyvis and networkx☆79Updated last year
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- Few-shot Named Entity Recognition☆122Updated 2 years ago
- Text analysis with networks.☆285Updated 6 months ago