suzgunmirac / hupd
The Harvard USPTO Patent Dataset
☆66Updated last year
Alternatives and similar repositories for hupd:
Users that are interested in hupd are comparing it to the libraries listed below
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆82Updated 5 months ago
- ☆64Updated 3 years ago
- US utility patent similarity data creation and analysis tools☆26Updated 4 years ago
- A python tool for reading, parsing and finding patent using the United States Patent and Trademark (USPTO) Bulk Data Storage System.☆51Updated 2 years ago
- Compute novelty indicators☆32Updated 10 months ago
- Transformer models for Augmented Inventing☆55Updated 3 years ago
- ☆38Updated 2 years ago
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆37Updated last year
- ☆18Updated 5 years ago
- Technology Semantic Network (TechNet)☆34Updated 2 years ago
- Project repository of the paper "Less Annotating, More Classifying – Addressing the Data Scarcity Issue of Supervised Machine Learning wi…☆31Updated last year
- Code for measuring novelty in science using publication text☆26Updated last month
- Curated list of resources for processing patent data☆74Updated 10 months ago
- Code for the JCDL 2023 paper CitePrompt: Using Prompts to Identify Citation Intent in Scientific Papers☆9Updated 2 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆67Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆31Updated 2 years ago
- Evaluation and benchmarking of PatentsView disambiguation algorithms☆13Updated last year
- ☆41Updated this week
- The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science …☆27Updated 2 years ago
- Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite☆93Updated last year
- ☆26Updated 3 years ago
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- ☆87Updated 11 months ago
- Search for and retrieve US Patent and Trademark Office Patent Data☆79Updated 4 years ago
- Measuring the Evolution of a Scientific Field through Citation Frames☆56Updated 6 years ago
- ☆27Updated 3 years ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆23Updated 2 years ago
- ✨ Awesome - A curated list of amazing Topic Models (implementations, libraries, and resources)☆93Updated 2 years ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆104Updated last year
- Introducing gpt_annotate: an easy-to-use python package designed to streamline automated text annotation using LLMs for different tasks a…☆28Updated 7 months ago