markhuberty / psClean
Python library for cleaning, disambiguating, and formatting inventors in the PATSTAT patent data file
☆22Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for psClean
- patent analysis tool in R☆14Updated 7 years ago
- ☆11Updated 8 years ago
- Text Mining Patents for Big Data Course Project☆26Updated 8 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Turning news into events since 2014.☆50Updated 7 years ago
- ☆17Updated 4 years ago
- Calculate weighted mean, median, and weighted median.☆19Updated 4 years ago
- Topic modelling with SpaCy, Gensim and Textacy☆19Updated 6 years ago
- Community detection in patent co-citation network☆12Updated 5 years ago
- The USPTO Patent Exploring Tool (UPET) provides Python code for downloading, parsing, and loading USPTO patent bulk data into a local MyS…☆34Updated 11 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆193Updated last year
- Tools to work with patent files released by Google.☆19Updated 11 years ago
- This is the text partitioner project for Python.☆20Updated 5 years ago
- Statistical inference on machine learning or general non-parametric models☆43Updated 6 months ago
- Scrapes the web. Gets the news.☆13Updated 8 years ago
- Patent Classification with Machine Learning☆14Updated 5 years ago
- ☆36Updated 3 weeks ago
- Currency Portfolio Optimization - IPython notebook and data☆25Updated 8 years ago
- ☆30Updated 4 months ago
- A list of GDELT themes that taken together broadly represent "issues" and media source lists, a way to split GDELT sources into more conc…☆20Updated 5 years ago
- https://github.com/jcgcarranza/respol_patents_code☆29Updated 4 years ago
- Clinical trial designs and methods in Python☆19Updated 8 years ago
- EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and E…☆41Updated 2 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- Set of scripts to aid in the download of the GDELT data files from gdelt.utdallas.edu☆16Updated 10 years ago
- This repository is not maintained anymore. ConfusionMatrix is now part of pandas-ml☆19Updated 8 years ago
- Material from presentations☆13Updated 3 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 9 years ago