ljvmiranda921 / prodigy-pdf-custom-recipeLinks
Custom recipe and utilities for document processing
☆199Updated 3 years ago
Alternatives and similar repositories for prodigy-pdf-custom-recipe
Users that are interested in prodigy-pdf-custom-recipe are comparing it to the libraries listed below
Sorting:
- Quote extraction for modular journalism (JournalismAI collab 2021)☆230Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated 2 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆86Updated 3 years ago
- Public runnable examples of using John Snow Labs' OCR for Apache Spark.☆93Updated last week
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated 5 months ago
- Gain clues from clustering!☆318Updated last year
- Streamline scikit-learn model comparison.☆143Updated 2 years ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆243Updated last year
- A comprehensive reference for all topics related to building and maintaining microservices☆67Updated 2 years ago
- Spacy NER annotator using ipywidgets☆123Updated last year
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆219Updated 7 months ago
- Explainable Zero-Shot Topic Extraction☆63Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Mastering spaCy, published by Packt☆136Updated last week
- SpikeX - SpaCy Pipes for Knowledge Extraction☆399Updated 4 years ago
- 100 applications built with H2O Wave☆99Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- A data labelling tool based on Streamlit.☆23Updated 4 years ago
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document …☆187Updated last year
- Build Low Code Automated Tensorflow explainable models in just 3 lines of code. Library created by: Hasan Rafiq - https://www.linkedin.co…☆181Updated 2 years ago
- A comprehensive tool for linguistic analysis of communities☆49Updated 3 years ago
- Information extraction from English and German texts based on predicate logic☆138Updated 2 years ago
- 🔔 No need to keep checking your training - just one import line and you'll know the second it's done.☆344Updated 3 years ago
- A python library for extracting text from PDFs without losing the formatting of the PDF content.☆77Updated 3 years ago
- Text analysis with networks.☆288Updated 5 months ago
- Build tensorflow keras model pipelines in a single line of code. Now with mlflow tracking. Created by Ram Seshadri. Collaborators welcome…☆121Updated last year
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago