JYProjs / patentpy
Python package to access USPTO bulk data in rectangular format
☆15Updated 2 years ago
Alternatives and similar repositories for patentpy:
Users that are interested in patentpy are comparing it to the libraries listed below
- ☆31Updated this week
- ☆31Updated last year
- The Harvard USPTO Patent Dataset☆62Updated last year
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆30Updated 10 months ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- Package to parse and analyze trademark data from the United States Patent and Trademark Office☆12Updated 7 years ago
- A python package to enrich Twitter Data☆74Updated last year
- Fuzzy Topic Models☆26Updated 9 months ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Tools for interactive visual exploration of semantic embeddings.☆29Updated 4 months ago
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆15Updated last year
- A python tool for reading, parsing and finding patent using the United States Patent and Trademark (USPTO) Bulk Data Storage System.☆47Updated 2 years ago
- ☆84Updated 8 months ago
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 4 years ago
- A tool for Semantic Scaling of Political Text (branch of Topfish, a suite of tools for Political Text Analysis)☆27Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆57Updated last year
- ☆18Updated 4 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Updated 9 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Repository for the Tweet2Story framework for the extraction of narratives from tweets.☆13Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆89Updated last year
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 weeks ago
- Transformer models for Augmented Inventing☆55Updated 3 years ago
- spaCy powered Label Studio ML backend☆30Updated 2 years ago
- Easy PDF to text to spaCy text extraction in Python.☆38Updated 3 months ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆74Updated 2 months ago
- The official implementation of the iConference 2022 paper "Identifying Machine-Paraphrased Plagiarism".☆17Updated 2 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆15Updated 5 months ago
- ☆54Updated last year
- KeypartX is a graph-based approach to represent perception (text in general) by key parts of speech.Updated last year