Company Name Processor written in Python
☆351Jan 16, 2026Updated last month
Alternatives and similar repositories for cleanco
Users that are interested in cleanco are comparing it to the libraries listed below
Sorting:
- Using Natural Language Processing to standardize Company Names☆11Aug 4, 2021Updated 4 years ago
- Match Patent Assignees with Compustat and SDC via Bing Search☆55Sep 29, 2020Updated 5 years ago
- Entity Matching Model solves the problem of matching company names between two possibly very large datasets.☆90Updated this week
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,046Feb 21, 2024Updated 2 years ago
- Python and R code for text analysis and topic modeling☆12Jun 14, 2021Updated 4 years ago
- EAA - Python Primer for Accounting Research☆13Feb 15, 2022Updated 4 years ago
- Code to incorporate non-compete law changes using Stata, R and Python (Ewens and Marx (2017))☆12Jun 27, 2023Updated 2 years ago
- A database on VC-backed startups from Ewens and Malenko (2025)☆13Feb 15, 2025Updated last year
- Example code to create firm level risk in Hassan et al. (2020)☆57Aug 23, 2022Updated 3 years ago
- Super Fast String Matching in Python☆371Mar 14, 2025Updated 11 months ago
- A demonstration transnational register of beneficial ownership data from the UK, Denmark, Slovakia and Armenia☆19Oct 30, 2024Updated last year
- Link tables between SIC and Fama French Industry Classification.☆21Oct 14, 2024Updated last year
- Python package to accelerate the sparse matrix multiplication and top-n similarity selection☆420Jan 12, 2026Updated last month
- High dimensional fixed effect absorption with Python 3☆57Feb 7, 2024Updated 2 years ago
- Code for the Spring 2025 NBER heterogeneous-agent macro workshop☆59Jul 7, 2025Updated 7 months ago
- PhD 403: Empirical Asset Pricing☆28Dec 3, 2018Updated 7 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,438Jul 29, 2025Updated 7 months ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆16Aug 20, 2018Updated 7 years ago
- A simple command line interface to the datamade/dedupe library.☆43Dec 26, 2022Updated 3 years ago
- Sidewall is a Python library for interacting with the Dimensions search API.☆18Sep 11, 2024Updated last year
- Measuring the Market Risk Premium☆18Jun 6, 2022Updated 3 years ago
- Stata command to generate color schemes☆18Sep 3, 2021Updated 4 years ago
- Python utilities for working with Kilts-Nielsen files☆46Updated this week
- This repository provides the replication code and data for Kogan, L., Papanikolaou, D., Seru, A. and Stoffman, N., QJE 2017.☆41Aug 16, 2021Updated 4 years ago
- Code to manage data related to SEC EDGAR☆33Aug 21, 2025Updated 6 months ago
- A python script to create a mapping table between I/B/E/S and Compustat☆18Oct 24, 2019Updated 6 years ago
- This repository provides updates and extended data following Kogan, L., Papanikolaou, D., Seru, A. and Stoffman, N., QJE 2017☆203Dec 8, 2025Updated 2 months ago
- a versioned .csv file that auto-updates from the i3 index google sheet☆24Jan 19, 2026Updated last month
- Resources for a PhD class module focused on anomalies.☆19Jun 7, 2024Updated last year
- A mapping between SDCs M&A database and the gvkey's in Compustat☆90Jul 17, 2024Updated last year
- Explanation of IPO data extraction from SDC Platinum, data cleaning and matching with CRSP☆21Aug 15, 2017Updated 8 years ago
- Stata module to make regression tables☆81Apr 9, 2023Updated 2 years ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,193Dec 15, 2025Updated 2 months ago
- Python bindings to libpostal for fast international address parsing/normalization☆864Nov 1, 2025Updated 4 months ago
- Stata Workflows for LaTeX Output☆122Feb 28, 2021Updated 5 years ago
- Econometrics and data manipulation functions.☆114Aug 13, 2021Updated 4 years ago
- Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends☆1,980Updated this week
- List of entity resolution software and resources.☆109Feb 22, 2025Updated last year
- A simple Python module for parsing human names into their individual components☆702May 28, 2024Updated last year