Gawaboumga / CompanyMatchingLinks
Fuzzy matching for companies'names
☆9Updated 5 years ago
Alternatives and similar repositories for CompanyMatching
Users that are interested in CompanyMatching are comparing it to the libraries listed below
Sorting:
- Scalable String Similarity Joins in Python☆39Updated 10 months ago
- Hidden alignment conditional random field for classifying string pairs.☆24Updated 3 weeks ago
- A browser user interface for manual labeling of record pairs.☆47Updated last year
- A maximum-strength name parser for record linkage.☆37Updated last month
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- ☆30Updated 2 years ago
- Python package aiding in entity disambiguation based on string and location matching☆18Updated last year
- Named entity recognition for the legal domain☆42Updated 4 years ago
- An index data structure for approximate string search.☆23Updated 6 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- demo using FuzzyWuzzy matching company names☆75Updated 3 years ago
- Using word embeddings (word2vec) for ontology learning☆19Updated 8 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 6 years ago
- Package that returns a company embedding given a company name☆46Updated 5 years ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated this week
- Language detection using Spacy and Fasttext☆55Updated last year
- Python package for deduplication/entity resolution using active learning☆80Updated 9 months ago
- This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs o…☆47Updated 7 years ago
- Fuzzy Categorical Distances☆14Updated 5 years ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆19Updated 5 years ago
- PAL: A tool for Pre-annotation and Active Learning☆18Updated 4 years ago
- A Flexible Deep Learning Approach to Fuzzy String Matching☆145Updated 7 months ago
- Code examples for Google Natural Language API.☆13Updated 5 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- ☆69Updated 3 years ago
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆24Updated 8 months ago
- A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).☆15Updated 4 years ago
- Lossless in-memory compression of pandas DataFrames and Series powered by the visions type system. Up to 10x less RAM needed for the same…☆29Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago