☆78Mar 6, 2023Updated 3 years ago
Alternatives and similar repositories for aurum-datadiscovery
Users that are interested in aurum-datadiscovery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- D3L dataset discovery framework - an implementation of the ICDE 2020 paper with the same name: https://arxiv.org/pdf/2011.10427.pdf☆21Nov 18, 2021Updated 4 years ago
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆15Dec 24, 2023Updated 2 years ago
- ☆11Jul 21, 2017Updated 8 years ago
- A Jupyter notebook extension to centralize and manage data☆15Dec 22, 2022Updated 3 years ago
- LSH index for approximate set containment search☆61Jun 27, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆22Jan 3, 2023Updated 3 years ago
- T2K Match is a matching algorithm optimised to match millions of web tables to a central knowledge base.☆21May 5, 2018Updated 7 years ago
- Mirror from: https://gitlab.com/ViDA-NYU/auctus/auctus☆44May 12, 2025Updated 10 months ago
- ☆18Mar 18, 2026Updated last week
- Implementation of algorithms for semantic table implementation, including the TableMiner+ method☆19Sep 1, 2022Updated 3 years ago
- PowerQuery (M Language) AST and Parser in Haskell☆11Aug 6, 2020Updated 5 years ago
- Set-oriented Operations in Pandas☆24May 27, 2020Updated 5 years ago
- ☆22Jun 10, 2020Updated 5 years ago
- Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data☆21Apr 14, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A search engine for Open Data☆59Mar 15, 2023Updated 3 years ago
- Pattern-based table discovery in Open Data CSV files☆25Dec 8, 2022Updated 3 years ago
- TPC-H benchmark, specific for mysql☆25Apr 18, 2013Updated 12 years ago
- Image Captioning: Implementing the Neural Image Caption Generator☆21Oct 14, 2020Updated 5 years ago
- Data and code for the experiments in: "Hypernyms under Siege: Linguistically-motivated Artillery for Hypernymy Detection". Vered Shwartz,…☆51Jun 26, 2018Updated 7 years ago
- simialrity join or search on spark core directly☆28Jul 23, 2020Updated 5 years ago
- A python tool using XGboost and sentence-transformers to perform schema matching task on tables.☆40Mar 8, 2026Updated 3 weeks ago
- This is the repo for Multi-level textual grounding☆34Jul 21, 2020Updated 5 years ago
- This repository provides the implementation of several well-know INDs discovery algorithms☆14Nov 5, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆31Nov 10, 2021Updated 4 years ago
- ☆32Apr 15, 2023Updated 2 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Nov 14, 2018Updated 7 years ago
- low level kernels to benchmark peak compute, cache bandwidth on various levels, memory bandwidth, and some basic compute routines☆10Jan 3, 2026Updated 2 months ago
- The source code of the Sudowoodo paper in ICDE 2023☆18May 24, 2023Updated 2 years ago
- Autonomous Agent for Kubernetes☆14Feb 14, 2025Updated last year
- A new framework to generate interpretable classification rules☆18Feb 11, 2023Updated 3 years ago
- ☆10Jun 28, 2025Updated 9 months ago
- support kubernetes feature for autogen(https://github.com/microsoft/autogen)☆11Sep 15, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- LaTex source and slides of my HKUST Mphil Thesis on "Mars: accelerating MapReduce with graphics processors"☆18Aug 29, 2012Updated 13 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- ☆11Jan 3, 2023Updated 3 years ago
- Interactive cleaning for Pandas DataFrames☆16Nov 29, 2019Updated 6 years ago
- Probabilistic data structures for processing very large datasets (MinHash, HyperLogLog)☆11Aug 20, 2015Updated 10 years ago
- Helm chart to deploy Kellnr on kubernetes☆15Mar 18, 2026Updated last week
- Manipulating semantic data within Python☆18Jan 14, 2025Updated last year