markhuberty / psClean
Python library for cleaning, disambiguating, and formatting inventors in the PATSTAT patent data file
☆22Updated 11 years ago
Alternatives and similar repositories for psClean
Users that are interested in psClean are comparing it to the libraries listed below
Sorting:
- Topic models (just LDA for now) on the Hacker News corpus☆22Updated 9 years ago
- ☆18Updated 5 years ago
- ☆11Updated 8 years ago
- patent analysis tool in R☆15Updated 7 years ago
- The USPTO Patent Exploring Tool (UPET) provides Python code for downloading, parsing, and loading USPTO patent bulk data into a local MyS…☆34Updated 12 years ago
- ☆70Updated 11 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Fast, flexible name matching for large datasets☆72Updated last year
- Create and customize your own Periodic Table. With the help of Streamlit and Bokeh.☆18Updated 2 years ago
- Tools to work with patent files released by Google.☆19Updated 12 years ago
- An alternative approach for probabilistic topic modeling based on agglomerative clustering of topics (not documents)☆12Updated 4 years ago
- Turning news into events since 2014.☆51Updated 8 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- Scrapes the web. Gets the news.☆13Updated 8 years ago
- Topic modelling with SpaCy, Gensim and Textacy☆19Updated 7 years ago
- Text Mining Patents for Big Data Course Project☆28Updated 9 years ago
- Python module for bibliographic network analysis.☆84Updated 4 years ago
- This is the text partitioner project for Python.☆21Updated 6 years ago
- ☆16Updated 6 years ago
- Generates the most important key-phrase/key-words from a document based on a corpus☆10Updated 11 months ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated 2 years ago
- extract relationships from standardized terms from corpus of interest with deep learning☆20Updated 5 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated last year
- ☆19Updated 6 years ago
- Full paper available on Researchgate☆17Updated 6 years ago
- A Chinese name matcher written in Python. Describe in: Nanyun Peng, Mo Yu, Mark Dredze. An Empirical Study of Chinese Name Matching and A…☆36Updated 9 years ago
- Multilayer Feed-Forward Neural Network predictive model implementations with TensorFlow and scikit-learn☆45Updated 2 years ago
- ☆41Updated last week
- Experiment, Storage and Visualization Framework for Machine Learning research.☆31Updated 3 years ago
- Probabilistic/machine-learning algorithms for medical record linkage [Critical Juncture]☆14Updated 7 years ago