VIDA-NYU / auctus
Dataset search engine, discovering data from a variety of sources, profiling it, and allowing advanced queries on the index
☆43Updated last year
Alternatives and similar repositories for auctus:
Users that are interested in auctus are comparing it to the libraries listed below
- Project overview and links to various resources☆19Updated 3 years ago
- A Jupyter notebook extension to centralize and manage data☆14Updated 2 years ago
- ☆11Updated last year
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆13Updated last year
- Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning☆51Updated 2 years ago
- A tool facilitating matching for any dataset discovery method. Also, an extensible experiment suite for state-of-the-art schema matching …☆88Updated 3 weeks ago
- Loading OpenSanctions into Neo4J and Linkurious☆28Updated 4 months ago
- Jenga is an experimentation library that allows data science practititioners and researchers to study the effect of common data corruptio…☆40Updated last year
- FlowSense: A Natural Language Interface for Visual Data Exploration within a Dataflow System☆46Updated 2 years ago
- JedAI-WebApp is a GUI that facilitates the execution of JedAI. JedAI is an open source, high scalability toolkit that offers out-of-the-b…☆23Updated 2 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 3 years ago
- ☆15Updated 2 years ago
- PyPi module for Graphlet AI Knowledge Graph Factory☆29Updated 2 years ago
- Welcome to Snowman App – a Data Matching Benchmark Platform.☆38Updated 2 years ago
- ☆11Updated 7 years ago
- A Tree Search Library for Data Cleaning☆22Updated 3 years ago
- Pattern-based table discovery in Open Data CSV files☆25Updated 2 years ago
- ☆17Updated last week
- ☆26Updated 3 years ago
- Graph Engine for Exploration and Search☆40Updated last year
- deep entity resolution lite version☆11Updated 5 years ago
- A Generalized Data Cleaning System☆49Updated 8 years ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Inspect ML Pipelines in Python in the form of a DAG☆70Updated last year
- Rich Context leaderboard competition, including the corpus and current SOTA for required tasks.☆21Updated 4 years ago
- Scalable String Similarity Joins in Python☆39Updated 9 months ago
- Knowledge base construction from raw scientific documents☆38Updated 4 months ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆48Updated last year
- Build Neo4j graphs from Datashare projects☆12Updated last week
- Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.☆23Updated 9 years ago