suzgunmirac / hupd
The Harvard USPTO Patent Dataset
☆68Updated last year
Alternatives and similar repositories for hupd
Users that are interested in hupd are comparing it to the libraries listed below
Sorting:
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆83Updated 6 months ago
- ☆66Updated 3 years ago
- A python tool for reading, parsing and finding patent using the United States Patent and Trademark (USPTO) Bulk Data Storage System.☆52Updated 2 years ago
- US utility patent similarity data creation and analysis tools☆26Updated 4 years ago
- Transformer models for Augmented Inventing☆55Updated 3 years ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆32Updated last year
- Code for measuring novelty in science using publication text☆26Updated 2 months ago
- ☆18Updated 5 years ago
- Compute novelty indicators☆33Updated 11 months ago
- https://github.com/jcgcarranza/respol_patents_code☆34Updated 4 years ago
- Technology Semantic Network (TechNet)☆34Updated 2 years ago
- ☆38Updated 2 years ago
- A large-scale open data lake for the science of science research.☆75Updated 2 months ago
- ☆28Updated 2 years ago
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆13Updated last year
- Parse and cluster USPTO patent data. Includes applications, grants, assignments, and maintenance.☆136Updated last year
- MultiCite code and data. Models are available on Huggingface.☆31Updated 3 years ago
- ☆70Updated 2 months ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆37Updated last year
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆67Updated 2 years ago
- Measuring the Evolution of a Scientific Field through Citation Frames☆56Updated 6 years ago
- Resources and tools for the Tutorial - "Hate speech detection, mitigation and beyond" presented at ICWSM 2021☆37Updated 3 years ago
- Curated list of resources for processing patent data☆75Updated 11 months ago
- ☆41Updated this week
- HDBSCAN Tuning for BERTopic Models☆45Updated last year
- Code and Data for Text Classification of AI Related Patents Research Paper☆35Updated last year
- EDGAR10-Q Dataset and implementation of the paper Context NER☆17Updated last year
- A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning☆118Updated last month
- ☆91Updated 2 years ago