Sivalavida / Text-based-Industry-ClassificationLinks
Using NLP techniques to classify companies according to their descriptions
☆13Updated 4 years ago
Alternatives and similar repositories for Text-based-Industry-Classification
Users that are interested in Text-based-Industry-Classification are comparing it to the libraries listed below
Sorting:
- A browser user interface for manual labeling of record pairs.☆47Updated last year
- List of entity resolution software and resources.☆71Updated 3 months ago
- Python package for text mining of time-series data☆73Updated last month
- Audit CDM☆14Updated 3 months ago
- Powerful topic model visualization in Python☆124Updated 2 months ago
- How can we improve name matching in screening tools?☆12Updated 4 months ago
- ☆39Updated 3 months ago
- Loading OpenSanctions into Neo4J and Linkurious☆28Updated 5 months ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated last week
- Examples For AI Agent☆13Updated 4 months ago
- The dataset used to evaluate JobBERT on the task of job title normalization.☆27Updated 2 years ago
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆13Updated 2 months ago
- An open interface to GDELT APIs☆49Updated last year
- DuckDB Community Extension to prompt LLMs from SQL☆48Updated 5 months ago
- Using Natural Language Processing to standardize Company Names☆12Updated 3 years ago
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆210Updated this week
- Bulk loading of large data sets into Neo4j☆22Updated last month
- Interactive notebooks containing demonstration code of the splink library☆38Updated last year
- new skills taxonomy using TextKernel data☆33Updated 2 years ago
- A maximum-strength name parser for record linkage.☆37Updated last month
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 9 months ago
- A package to build an end-to-end pipeline for detecting personally identifiable information from text.☆45Updated 6 years ago
- Analyzing crime reported in the U.S. using data derived from commoncrawl, New York Times api and twitter data.☆16Updated 5 years ago
- Record matching and entity resolution at scale in Spark☆34Updated last year
- Codeless Deep Learning with KNIME☆14Updated 2 years ago
- LLM extension for OpenRefine☆22Updated 2 weeks ago
- Entity Resolution☆13Updated last year
- An automation tool to refactor Jupyter Notebooks to Python modules, with code dependency analysis.☆12Updated 3 months ago
- A Python client for the GDELT 2.0 Doc API☆135Updated last month
- NLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to …☆35Updated 2 years ago