verginer / disamby
Python package aiding in entity disambiguation based on string and location matching
☆18Updated last year
Alternatives and similar repositories for disamby:
Users that are interested in disamby are comparing it to the libraries listed below
- Event extraction pipeline.☆34Updated 7 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 7 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)☆115Updated 8 months ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆17Updated 5 years ago
- Turning news into events since 2014.☆50Updated 7 years ago
- Language-agnostic political event coding using universal dependencies☆18Updated 5 years ago
- Ensemble topic modeling with matrix factorization☆24Updated 6 years ago
- Python library providing sentiment lexicons.☆26Updated 8 years ago
- Knowledge extraction from web data☆92Updated 6 years ago
- Data Server for Topic Models☆121Updated last year
- TAXI: a Taxonomy Induction Method based on Lexico-Syntactic Patterns, Substrings and Focused Crawling☆28Updated last year
- Scrapes the web. Gets the news.☆13Updated 8 years ago
- Another next-generation event coding platform.☆72Updated 5 years ago
- Tutorial code and data for the entity resolution workshops.☆44Updated 9 years ago
- Set of scripts to aid in the download of the GDELT data files from gdelt.utdallas.edu☆16Updated 10 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆86Updated 2 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- ☆30Updated 2 years ago
- Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019☆29Updated 5 years ago
- Python 2 & 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with…☆59Updated 6 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- Set of scripts to aid in the download of the GDELT data files from www.gdeltproject.org☆11Updated 10 years ago
- Active Learning for text classification using scikit-learn☆23Updated 5 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆42Updated 4 years ago
- [development moved to termite-data-server]☆61Updated 10 years ago
- topic model visualization☆52Updated 9 years ago
- Text Thresher crowd sourced text annotator☆17Updated 7 years ago
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- Quickly extract multi-word phrases from a corpus☆190Updated 4 years ago