perchrh / sanction_list_search
name search for people and entities on the EU, OFAC and UN sanction lists
☆24Updated 4 years ago
Alternatives and similar repositories for sanction_list_search:
Users that are interested in sanction_list_search are comparing it to the libraries listed below
- Package that returns a company embedding given a company name☆42Updated 4 years ago
- Fuzzy matching for companies'names☆9Updated 5 years ago
- Spell correct entire sentences using nltk freqdist and symspell☆19Updated 7 years ago
- Using ML to extract campaign finance data from messy forms for journalism☆76Updated 2 years ago
- Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.☆147Updated last week
- Restful Autocomplete service with Neo4j graph backend. Returns top suggestions.☆39Updated 2 weeks ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- Build a deep learning model for predicting the named entities from text.☆56Updated 6 years ago
- Loading OpenSanctions into Neo4J and Linkurious☆27Updated last month
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated 11 months ago
- Tutorial for Topic Modelling using PySpark and Spark NLP☆16Updated 4 years ago
- A DeepWalk implementation for ontologies using NetworkX and Gensim☆19Updated 7 years ago
- Extracting Semi-Structured Data from PDFs on a large scale☆51Updated 2 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 5 years ago
- Utilize the deep learning library Keras to classify transactions as fraudulent(1) or non-fraudulent(0).☆50Updated 6 years ago
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- Extracting addresses from text☆41Updated 6 years ago
- Code examples for Google Natural Language API.☆13Updated 5 years ago
- ☆13Updated 2 years ago
- ☆11Updated 4 years ago
- This repository contains the DFKI Product Corpus, a dataset of 174 documents annotated for product and company named entities, and the re…☆12Updated 4 months ago
- Prodigy thing(z)☆13Updated 6 years ago
- Python script for matching a list of messy addresses against a gazetteer using dedupe.☆62Updated 4 years ago
- 🚀GUI for training spaCy models☆54Updated 3 years ago
- Python/Django based webapps and web user interfaces for search, structure (meta data management like thesaurus, ontologies, annotations a…☆95Updated 2 years ago
- Topic modelling on financial news with Natural Language Processing☆58Updated 7 years ago
- ☆11Updated 3 years ago
- Algorithms for "schema matching"☆25Updated 8 years ago