suzgunmirac / hupdLinks
The Harvard USPTO Patent Dataset
☆75Updated last year
Alternatives and similar repositories for hupd
Users that are interested in hupd are comparing it to the libraries listed below
Sorting:
- A python tool for reading, parsing and finding patent using the United States Patent and Trademark (USPTO) Bulk Data Storage System.☆55Updated 3 years ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆95Updated 11 months ago
- ☆68Updated 4 years ago
- CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.☆154Updated 8 months ago
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆95Updated last month
- ☆31Updated 2 years ago
- Transformer models for Augmented Inventing☆55Updated 3 years ago
- Twitter dataset for 2022 Russian and Ukrainian crisis☆48Updated 2 years ago
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆37Updated last year
- Tensorflow 2 implementation of Causal-BERT☆71Updated last year
- The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science …☆28Updated 2 years ago
- Dataset accompanying the SPECTER model☆139Updated 2 years ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆105Updated last year
- ☆34Updated last year
- Collection of public APIs for embedding scientific papers☆58Updated 4 years ago
- Compute novelty indicators☆33Updated last year
- Measuring the Evolution of a Scientific Field through Citation Frames☆61Updated 7 years ago
- US utility patent similarity data creation and analysis tools☆26Updated 4 years ago
- Technology Semantic Network (TechNet)☆34Updated 2 years ago
- The Semantic Scholar Search Reranker☆108Updated 4 years ago
- HDBSCAN Tuning for BERTopic Models☆49Updated 2 years ago
- Pytorch implementation of "Adapting Text Embeddings for Causal Inference"☆92Updated 3 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆71Updated 2 years ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆33Updated last year
- Parse and cluster USPTO patent data. Includes applications, grants, assignments, and maintenance.☆137Updated last year
- Code, data, and models for "POLITICS: Pretraining with Same-story Article Comparison for Ideology Prediction and Stance Detection"☆36Updated last year
- Code for Everything Has a Cause: Leveraging Causal Inference in Legal Text Analysis (NAACL 2021 oral paper)☆66Updated 3 years ago
- ☆41Updated last month
- ☆79Updated last year
- [WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations☆91Updated 3 years ago