ljvmiranda921 / prodigy-pdf-custom-recipeLinks
Custom recipe and utilities for document processing
☆200Updated 3 years ago
Alternatives and similar repositories for prodigy-pdf-custom-recipe
Users that are interested in prodigy-pdf-custom-recipe are comparing it to the libraries listed below
Sorting:
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- Quote extraction for modular journalism (JournalismAI collab 2021)☆229Updated 3 years ago
- Public runnable examples of using John Snow Labs' OCR for Apache Spark.☆92Updated last week
- Gain clues from clustering!☆318Updated last year
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆86Updated 3 years ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆222Updated 3 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆74Updated 2 years ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆244Updated 2 years ago
- A comprehensive reference for all topics related to building and maintaining microservices☆67Updated 3 years ago
- Streamline scikit-learn model comparison.☆143Updated 3 years ago
- Label data using HuggingFace's transformers and automatically get a prediction service☆193Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Updated 11 months ago
- 100 applications built with H2O Wave☆98Updated 3 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆142Updated 9 months ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆243Updated last year
- Spacy NER annotator using ipywidgets☆124Updated last year
- Explainable Zero-Shot Topic Extraction☆65Updated last year
- SpikeX - SpaCy Pipes for Knowledge Extraction☆400Updated 4 years ago
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document …☆186Updated last year
- A comprehensive tool for linguistic analysis of communities☆49Updated 4 years ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- ☆83Updated 3 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆79Updated 3 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Updated 2 months ago
- Information extraction from English and German texts based on predicate logic☆139Updated 2 years ago
- Build Low Code Automated Tensorflow explainable models in just 3 lines of code. Library created by: Hasan Rafiq - https://www.linkedin.co…☆180Updated 3 years ago
- A data labelling tool based on Streamlit.☆23Updated 4 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆105Updated last year
- Few-shot Named Entity Recognition☆122Updated 3 years ago