JYProjs / patentpyLinks
Python package to access USPTO bulk data in rectangular format
☆16Updated 3 years ago
Alternatives and similar repositories for patentpy
Users that are interested in patentpy are comparing it to the libraries listed below
Sorting:
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆133Updated 2 months ago
- A python tool for reading, parsing and finding patent using the United States Patent and Trademark (USPTO) Bulk Data Storage System.☆57Updated 3 years ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆120Updated 2 months ago
- A python package to enrich Twitter Data☆75Updated 2 years ago
- A series of Jupyter Notebooks that demonstrate how to scrape data from the S&P Capital IQ Website, provided that you already have access …☆18Updated 6 years ago
- FiNER: Financial Numeric Entity Recognition for XBRL Tagging☆70Updated 3 years ago
- Nesta's Skills Extractor Library☆150Updated 6 months ago
- Library for creating causal chains using language models.☆81Updated 2 years ago
- The Harvard USPTO Patent Dataset☆79Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitter☆111Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated last week
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆222Updated 3 years ago
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆13Updated last year
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆103Updated last year
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document …☆186Updated last year
- Robust and fast topic models with sentence-transformers.☆88Updated 3 weeks ago
- ☆31Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- Noise-robust de-duplication at scale☆19Updated 2 years ago
- Course repository for the session "Hands-on Transformers: Fine-Tune your own BERT and GPT" of the Data Science Summer School 2023☆90Updated 2 years ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆137Updated last year
- Source code and data for Like a Good Nearest Neighbor☆30Updated 11 months ago
- Google Trends, made easy.☆117Updated last year
- Learning from Neighbors: Unsupervised Text Classification☆17Updated 3 years ago
- ☆54Updated 2 weeks ago
- A Corpus of 475,000 Industrial Occupations☆70Updated 5 years ago
- CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.☆155Updated 11 months ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆34Updated last year
- A dataset for pretraining language models targeted for legal tasks.☆140Updated 3 years ago