qcri / NADEEFLinks
A Generalized Data Cleaning System
☆50Updated 9 years ago
Alternatives and similar repositories for NADEEF
Users that are interested in NADEEF are comparing it to the libraries listed below
Sorting:
- A Machine Learning System for Data Enrichment.☆75Updated 7 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 13 years ago
- SociaLite: query language for large-scale graph analysis and data mining☆110Updated 9 years ago
- ☆40Updated 9 years ago
- ☆79Updated 2 years ago
- Hybrid Question Answering (HAWK) -- is going to drive forth the OKBQA vision of hybrid question answering system using Linked Data and fu…☆16Updated 3 years ago
- Topological Anomaly Detection (TAD) per Gartley and Basener 2009☆68Updated 5 years ago
- A project for clustering text streams using locality-sensitive hashing (LSH) in Python☆26Updated 14 years ago
- zenvisage's foundational framework☆69Updated 2 years ago
- A Python library for learning from dimensionality reduction, supporting sparse and dense matrices.☆78Updated 8 years ago
- Data Server for Topic Models☆122Updated 2 years ago
- Tools for iterative knowledge base development with DeepDive☆120Updated 6 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Updated 8 years ago
- lightweight python wrapper for vowpal wabbit☆168Updated 5 years ago
- ☆92Updated 9 years ago
- ☆192Updated last year
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆98Updated 10 years ago
- Scalable Graph Mining☆63Updated 2 years ago
- Semantic Preserving Embeddings for Generalized Graphs☆31Updated 6 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆207Updated 2 years ago
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 6 years ago
- A library that allows serialization of SciKit-Learn estimators into PMML☆71Updated 6 years ago
- A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.☆148Updated last year
- WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing,…☆111Updated 3 years ago
- Tools for building a Lucene index for Semantic Vectors☆21Updated 10 years ago
- Tools and Libraries for Lexicon-Based Sentiment Analysis☆24Updated 9 years ago
- Extract opionion phrases from user reviews☆63Updated 11 years ago
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 6 years ago
- A web based data mining workflow platform with real-time analysis capabilities☆49Updated 2 years ago
- Machine Learning Tool Kit☆138Updated 5 years ago