trevorprater / serfLinks
Stanford Entity-Resolution Framework
☆24Updated 6 years ago
Alternatives and similar repositories for serf
Users that are interested in serf are comparing it to the libraries listed below
Sorting:
- SmallK: very fast data clustering tools☆14Updated 6 years ago
- Collection of some algorithms for entity resolution☆28Updated 9 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- ☆20Updated 8 years ago
- Algorithms for "schema matching"☆26Updated 8 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- variations of the record linkage model of Steorts et al. AISTATS 2014's "SMERED: A Bayesian Approach to Graphical Record Linkage and De-d…☆27Updated 8 years ago
- Semanticizest: dump parser and client☆20Updated 9 years ago
- ☆40Updated 8 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- Data Server for Topic Models☆121Updated 2 years ago
- Raw Wikipedia counts for entity linking☆19Updated 8 years ago
- Pattern-of-Behavior Search Tool☆11Updated 2 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 7 years ago
- Code for Sentiment Analysis Symposium tutorial demos.☆15Updated 8 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 9 years ago
- A Generalized Data Cleaning System☆50Updated 9 years ago
- A book on the applications of topic models.☆14Updated 7 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- deep inverse regression☆31Updated 9 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- A Machine Learning System for Data Enrichment.☆75Updated 6 years ago
- Examples of using Neo4j with R.☆23Updated 9 years ago
- PMML evaluator library for the PostgreSQL database (http://www.postgresql.org/)☆11Updated 10 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 7 years ago
- [hibernating] Dynamic topic models☆39Updated 9 years ago
- Near-Duplicate Detection in Python.☆25Updated 3 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago