sachinchaturvedi93 / Company-Name-Standardization
Using Natural Language Processing to standardize Company Names
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Company-Name-Standardization
- A client library for accessing the USPTO Open Data APIs, written in Python.☆91Updated 2 years ago
- Search for and retrieve US Patent and Trademark Office Patent Data☆76Updated 4 years ago
- Handy Jupyter Notebooks that I use in for Topic Modeling. Including text mining from PDF files, text preprocessing, Latent Dirichlet Allo…☆41Updated 5 years ago
- Gem to allow easy access to data from the WIPO PATENTSCOPE Web Service☆14Updated 3 years ago
- AI + Legal APIs: A Tool-Based Retrieval Augmented Generation Workbench for Legal AI UX Research.☆46Updated 3 weeks ago
- demo using FuzzyWuzzy matching company names☆74Updated 2 years ago
- ☆135Updated 3 weeks ago
- ☆53Updated 10 months ago
- Package that returns a company embedding given a company name☆42Updated 4 years ago
- ☆36Updated 3 weeks ago
- Parse and cluster USPTO patent data. Includes applications, grants, assignments, and maintenance.☆131Updated last year
- The USPTO Patent Exploring Tool (UPET) provides Python code for downloading, parsing, and loading USPTO patent bulk data into a local MyS…☆34Updated 11 years ago
- A collection of ORM-style clients to public patent data☆92Updated last month
- NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to …☆36Updated 2 years ago
- Automatically download all PDF files of searching results & their patent families found on Google Patents.☆58Updated last year
- PatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multip…☆100Updated last year
- Python client for EPO OPS, the European Patent Office's Open Patent Services API.☆144Updated 2 weeks ago
- Fuzzy matches and merging of datasets in pandas using csvmatch☆74Updated 4 years ago
- ☆22Updated 3 years ago
- PatentSBERTa: A Deep NLP based Hybrid Model for Patent Distance and Classification using Augmented SBERT☆70Updated 2 weeks ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆59Updated this week
- new skills taxonomy using TextKernel data☆30Updated 2 years ago
- Named entity recognition for the legal domain☆40Updated 3 years ago
- 📚 Process PDFs, Word documents and more with spaCy☆75Updated this week
- ☆61Updated this week
- A Named Entity Recognition system that extracts soft skills from text☆27Updated 3 months ago
- Code for measuring novelty in science using publication text☆15Updated 3 weeks ago
- Deploying Pyvis Interactive Network Graphs in Streamlit☆55Updated 2 years ago
- ☆13Updated 4 years ago
- SKILLSPAN: Competences as Spans for Skill Extraction from Job Postings☆56Updated 9 months ago