psolin/cleanco

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/psolin/cleanco)

psolin / cleanco

Company Name Processor written in Python

☆358

Alternatives and similar repositories for cleanco

Users that are interested in cleanco are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DeNederlandscheBank / name_matching
View on GitHub
Name matching is a Python package for the matching of company names. This package has been developed to match the names of companies from…
☆167Jul 2, 2026Updated last week
sachinchaturvedi93 / Company-Name-Standardization
View on GitHub
Using Natural Language Processing to standardize Company Names
☆11Aug 4, 2021Updated 4 years ago
danielm-github / patentsmatch_bingsearchapproach
View on GitHub
Match Patent Assignees with Compustat and SDC via Bing Search
☆55Sep 29, 2020Updated 5 years ago
rahulissar / ai-supply-chain
View on GitHub
Repository for common AI use cases in supply chain, procurement
☆24Oct 8, 2021Updated 4 years ago
openownership / register
View on GitHub
A demonstration transnational register of beneficial ownership data from the UK, Denmark, Slovakia and Armenia
☆19Oct 30, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
ssrn3632395 / The-Role-of-Corporate-Culture-in-Bad-Times
View on GitHub
Python and R code for text analysis and topic modeling
☆11Jun 14, 2021Updated 5 years ago
Wenzhi-Ding / FamaFrenchIndustry
View on GitHub
Link tables between SIC and Fama French Industry Classification.
☆22Oct 14, 2024Updated last year
michaelewens / Non-compete-Law-Changes
View on GitHub
Code to incorporate non-compete law changes using Stata, R and Python (Ewens and Marx (2017))
☆12Jun 27, 2023Updated 3 years ago
blucap / EEA_Python_Primer
View on GitHub
EAA - Python Primer for Accounting Research
☆13Feb 15, 2022Updated 4 years ago
alesee / Bussiness2Vector
View on GitHub
Jupyter Notebooks for Bussiness2Vector
☆13Jun 28, 2018Updated 8 years ago
ing-bank / EntityMatchingModel
View on GitHub
Entity Matching Model solves the problem of matching company names between two possibly very large datasets.
☆97May 18, 2026Updated last month
J535D165 / recordlinkage
View on GitHub
A powerful and modular toolkit for record linkage and duplicate detection in Python
☆1,054Feb 21, 2024Updated 2 years ago
michaelewens / vc_backed_boards
View on GitHub
A database on VC-backed startups from Ewens and Malenko (2025)
☆14Feb 15, 2025Updated last year
mschwedeler / firmlevelrisk
View on GitHub
Example code to create firm level risk in Hassan et al. (2020)
☆59Aug 23, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Bergvca / string_grouper
View on GitHub
Super Fast String Matching in Python
☆372Jul 3, 2026Updated last week
donbowen / Patent-Text-Variables
View on GitHub
Downloads all google patent pages, assembles bag-o-words, and constructs RETech and Patent Breadth
☆10Jun 9, 2025Updated last year
KPSS2017 / Technological-Innovation-Resource-Allocation-and-Growth-Replication-Kit
View on GitHub
This repository provides the replication code and data for Kogan, L., Papanikolaou, D., Seru, A. and Stoffman, N., QJE 2017.
☆42Aug 16, 2021Updated 4 years ago
iangow / edgar
View on GitHub
Code to manage data related to SEC EDGAR
☆33Aug 21, 2025Updated 10 months ago
ing-bank / sparse_dot_topn
View on GitHub
Python package to accelerate the sparse matrix multiplication and top-n similarity selection
☆423Jun 29, 2026Updated last week
dedupeio / dedupe
View on GitHub
A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
☆4,488Jul 29, 2025Updated 11 months ago
Innovation-Information-Initiative / Open-Innovation-Dataset-Index
View on GitHub
a versioned .csv file that auto-updates from the i3 index google sheet
☆24Mar 26, 2026Updated 3 months ago
jeffgortmaker / pyhdfe
View on GitHub
High dimensional fixed effect absorption with Python 3
☆60Feb 7, 2024Updated 2 years ago
dedupeio / dedupe-geocoder
View on GitHub
Demonstration of how dedupe might be used as geocoder
☆17Jun 21, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
KPSS2017 / Technological-Innovation-Resource-Allocation-and-Growth-Extended-Data
View on GitHub
This repository provides updates and extended data following Kogan, L., Papanikolaou, D., Seru, A. and Stoffman, N., QJE 2017
☆226Dec 8, 2025Updated 7 months ago
JarrodAJ / sec_employee_information_extraction
View on GitHub
NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …
☆15Aug 20, 2018Updated 7 years ago
jamesturk / jellyfish
View on GitHub
🪼 a python library for doing approximate and phonetic matching of strings.
☆2,221Jun 23, 2026Updated 2 weeks ago
IBM / MAX-Named-Entity-Tagger
View on GitHub
Locate and tag named entities in text
☆24Sep 17, 2025Updated 9 months ago
datamade / probablepeople
View on GitHub
a python library for parsing unstructured western names into name components.
☆621May 15, 2025Updated last year
MagnusDahlquist / PhD403
View on GitHub
PhD 403: Empirical Asset Pricing
☆26Dec 3, 2018Updated 7 years ago
matthieugomez / colorscheme
View on GitHub
Stata command to generate color schemes
☆18Mar 30, 2026Updated 3 months ago
openvenues / pypostal
View on GitHub
Python bindings to libpostal for fast international address parsing/normalization
☆880Nov 1, 2025Updated 8 months ago
RUrlus / ModelMetricUncertainty
View on GitHub
Python package for Model Metric Uncertainty estimation
☆17Jun 29, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
benjann / estout
View on GitHub
Stata module to make regression tables
☆83Apr 13, 2026Updated 2 months ago
drelhaj / CFIE-FRSE
View on GitHub
CFIE Final Report Structure Extractor (FRSE) is a free tool to detect structure and extract contents from UK Annual Reports
☆34Nov 16, 2020Updated 5 years ago
shade-econ / nber-workshop-2025
View on GitHub
Code for the Spring 2025 NBER heterogeneous-agent macro workshop
☆72Jul 7, 2025Updated last year
Gawaboumga / iso-20275-python
View on GitHub
ISO 20275
☆10Jun 12, 2026Updated 3 weeks ago
michaelewens / SDC-to-Compustat-Mapping
View on GitHub
A mapping between SDCs M&A database and the gvkey's in Compustat
☆95Jul 17, 2024Updated last year
eloualiche / RiskPremium
View on GitHub
Measuring the Market Risk Premium
☆18Mar 30, 2026Updated 3 months ago
dmsul / econtools
View on GitHub
Econometrics and data manipulation functions.
☆114Aug 13, 2021Updated 4 years ago