trevorprater / serfLinks
Stanford Entity-Resolution Framework
☆24Updated 7 years ago
Alternatives and similar repositories for serf
Users that are interested in serf are comparing it to the libraries listed below
Sorting:
- ☆20Updated 8 years ago
- SmallK: very fast data clustering tools☆14Updated 6 years ago
- Raw Wikipedia counts for entity linking☆19Updated 8 years ago
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- Vizlinc☆15Updated 9 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- R tools for GDELT and the Global Knowledge Graph☆14Updated 11 years ago
- variations of the record linkage model of Steorts et al. AISTATS 2014's "SMERED: A Bayesian Approach to Graphical Record Linkage and De-d…☆27Updated 8 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- ☆20Updated 8 years ago
- Turning news into events since 2014.☆51Updated 8 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 4 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- A book on the applications of topic models.☆14Updated 7 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- Pattern-of-Behavior Search Tool☆11Updated 3 years ago
- Using Word2Vec on lists and sets☆34Updated 3 weeks ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 7 years ago
- [hibernating] Dynamic topic models☆39Updated 10 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- deep inverse regression☆31Updated 9 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- Binding the GDELT universe in a Spark environment☆25Updated 2 years ago
- Latent dirichlet allocation (LDA) for datamicroscopes☆41Updated 9 years ago
- open source version of the Bonsai library☆26Updated 9 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- Topic Modeling the Sarah Palin emails.☆34Updated 13 years ago
- A Generalized Data Cleaning System☆50Updated 9 years ago
- ☆92Updated 9 years ago