ljvmiranda921 / prodigy-pdf-custom-recipeLinks
Custom recipe and utilities for document processing
☆199Updated 2 years ago
Alternatives and similar repositories for prodigy-pdf-custom-recipe
Users that are interested in prodigy-pdf-custom-recipe are comparing it to the libraries listed below
Sorting:
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated last year
- Spacy NER annotator using ipywidgets☆123Updated last year
- Gain clues from clustering!☆314Updated 10 months ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆72Updated last year
- Public runnable examples of using John Snow Labs' OCR for Apache Spark.☆91Updated this week
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆214Updated 4 months ago
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆85Updated 2 years ago
- A Simple Bulk Labelling Tool☆585Updated 5 months ago
- Quote extraction for modular journalism (JournalismAI collab 2021)☆229Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- just a bunch of useful embeddings for scikit-learn pipelines☆499Updated 2 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- A data labelling tool based on Streamlit.☆23Updated 3 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆140Updated 2 months ago
- Information extraction from English and German texts based on predicate logic☆136Updated last year
- Few-shot Named Entity Recognition☆123Updated 3 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 9 months ago
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated 2 years ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆242Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- Expose a Top2Vec model with a REST API.☆90Updated 2 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆118Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆256Updated 10 months ago
- 100 applications built with H2O Wave☆98Updated 2 years ago
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆155Updated last year
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆262Updated 6 months ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆322Updated last year
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆221Updated 2 years ago