suzgunmirac / hupdLinks
The Harvard USPTO Patent Dataset
☆77Updated last year
Alternatives and similar repositories for hupd
Users that are interested in hupd are comparing it to the libraries listed below
Sorting:
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆97Updated 11 months ago
- ☆69Updated 4 years ago
- A python tool for reading, parsing and finding patent using the United States Patent and Trademark (USPTO) Bulk Data Storage System.☆56Updated 3 years ago
- CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.☆154Updated 8 months ago
- Transformer models for Augmented Inventing☆55Updated 3 years ago
- Measuring the Evolution of a Scientific Field through Citation Frames☆61Updated 7 years ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆34Updated last year
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆97Updated last month
- ☆33Updated 2 years ago
- Compute novelty indicators☆33Updated last year
- Dataset accompanying the SPECTER model☆140Updated 2 years ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆105Updated last year
- Curated list of resources for processing patent data☆88Updated last year
- ☆18Updated 5 years ago
- Parse and cluster USPTO patent data. Includes applications, grants, assignments, and maintenance.☆137Updated last year
- Collection of public APIs for embedding scientific papers☆59Updated 4 years ago
- TopicGPT allows to integrate the benefits of LLMs into Topic Modelling☆28Updated last year
- US utility patent similarity data creation and analysis tools☆27Updated 5 years ago
- TopicGPT: A Prompt-Based Framework for Topic Modeling (NAACL'24)☆355Updated 7 months ago
- Tutorial and hands-on notebook on using the Knowledge Graph Toolkit (KGTK)☆80Updated 3 years ago
- The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science …☆28Updated 2 years ago
- Twitter dataset for 2022 Russian and Ukrainian crisis☆48Updated 2 years ago
- ✨ Awesome - A curated list of amazing Topic Models (implementations, libraries, and resources)☆98Updated 3 years ago
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆37Updated last year
- Code and data for "Heterogeneous Supervised Topic Models"☆11Updated 3 years ago
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆29Updated last year
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆13Updated last year
- Pytorch implementation of "Adapting Text Embeddings for Causal Inference"☆92Updated 3 years ago
- ☆34Updated last year
- Technology Semantic Network (TechNet)☆34Updated 2 years ago