rlayers / pawpaw
Text Processing & Segmentation Framework
☆16Updated this week
Related projects ⓘ
Alternatives and complementary repositories for pawpaw
- Package to parse and analyze trademark data from the United States Patent and Trademark Office☆12Updated 7 years ago
- LEMON: Explainable Entity Matching☆18Updated 2 years ago
- Repository for "Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for Financial Tasks"☆24Updated last year
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- Collecting news articles for all the companies in the R1000, for a pre-defined set of news outlets, using Diffbot's Knowledge Graph☆10Updated last year
- ☆15Updated 3 years ago
- Gem to allow easy access to data from the WIPO PATENTSCOPE Web Service☆14Updated 3 years ago
- Datamallet is a python library which contains several helper functions and module for the common tasks in a typical data science workflow…☆11Updated 2 years ago
- A Temporal Networks Library written in Python☆12Updated 3 years ago
- This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the re…☆12Updated 2 months ago
- A demo of the Mito Streamlit Spreadsheet☆16Updated last year
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 3 years ago
- pyspark-parallelised functions producing graph-theoretical metrics in connected component clusters for use in record-linkage (or other do…☆10Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 9 months ago
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆11Updated 6 months ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- A Python package for PME (Public Market Equivalent) calculation☆10Updated last year
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆20Updated 8 months ago
- ☆13Updated 4 months ago
- Python package to access USPTO bulk data in rectangular format☆14Updated 2 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- Fast Python Vowpal Wabbit wrapper☆12Updated 3 years ago
- Personalization with deep learning in 100 lines of code☆14Updated last year
- BirdSpotter is a python package which provides an influence and bot detection toolkit for twitter.☆19Updated 3 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Updated 2 years ago
- The implementation of GTMF(Ground Truth Maturity Framework)☆18Updated 5 months ago
- A dashboard is worth a thousand words => https://datastudio.google.com/reporting/755f3183-dd44-4073-804e-9f7d3d993315☆27Updated 3 years ago