AI-Growth-Lab / PatentSBERTaLinks
PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT
☆95Updated 11 months ago
Alternatives and similar repositories for PatentSBERTa
Users that are interested in PatentSBERTa are comparing it to the libraries listed below
Sorting:
- ☆68Updated 4 years ago
- ☆31Updated 2 years ago
- The Harvard USPTO Patent Dataset☆75Updated last year
- Domain Specific BERT Model for Text Mining in Sustainable Investing☆140Updated 3 months ago
- A python tool for reading, parsing and finding patent using the United States Patent and Trademark (USPTO) Bulk Data Storage System.☆55Updated 3 years ago
- Code and experiments for *BERTopic: Neural topic modeling with a class-based TF-IDF procedure*☆82Updated last year
- Transformer models for Augmented Inventing☆55Updated 3 years ago
- Code and Data for Text Classification of AI Related Patents Research Paper☆42Updated 2 years ago
- https://github.com/jcgcarranza/respol_patents_code☆38Updated 4 years ago
- ☆41Updated 3 years ago
- Parse and cluster USPTO patent data. Includes applications, grants, assignments, and maintenance.☆137Updated last year
- US utility patent similarity data creation and analysis tools☆26Updated 4 years ago
- A collection of topic diversity measures for topic modeling☆47Updated 4 years ago
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆37Updated last year
- Text analysis with networks.☆288Updated 2 weeks ago
- CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.☆154Updated 8 months ago
- HDBSCAN Tuning for BERTopic Models☆49Updated 2 years ago
- TopicGPT: A Prompt-Based Framework for Topic Modeling (NAACL'24)☆350Updated 6 months ago
- TweetNLP for all the NLP enthusiasts working on Twitter! The Python library tweetnlp provides a collection of useful tools to analyze/und…☆359Updated 6 months ago
- Target-dependent sentiment classification in news articles reporting on political events. Includes a high-quality data set of over 11k se…☆155Updated 2 months ago
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆144Updated last year
- A list of GDELT themes that taken together broadly represent "issues" and media source lists, a way to split GDELT sources into more conc…☆21Updated 6 years ago
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆96Updated 2 years ago
- Patent analysis using the Google Patents Public Datasets on BigQuery☆618Updated last year
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆264Updated 11 months ago
- Code for measuring novelty in science using publication text☆32Updated 7 months ago
- BERT, LDA, and TFIDF based keyword extraction in Python☆74Updated last year
- Technology Semantic Network (TechNet)☆34Updated 2 years ago
- Using the Gmail API to topic model my recommended Medium reads☆24Updated 4 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆70Updated 2 years ago