☆78Mar 6, 2023Updated 3 years ago
Alternatives and similar repositories for aurum-datadiscovery
Users that are interested in aurum-datadiscovery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- D3L dataset discovery framework - an implementation of the ICDE 2020 paper with the same name: https://arxiv.org/pdf/2011.10427.pdf☆21Nov 18, 2021Updated 4 years ago
- Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method☆15Dec 24, 2023Updated 2 years ago
- ☆11Jul 21, 2017Updated 8 years ago
- Code and Benchmarks for JOSIE (SIGMOD 2019)☆20Apr 13, 2023Updated 3 years ago
- A Jupyter notebook extension to centralize and manage data☆15Dec 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LSH index for approximate set containment search☆62Jun 27, 2022Updated 4 years ago
- ☆22Jan 3, 2023Updated 3 years ago
- T2K Match is a matching algorithm optimised to match millions of web tables to a central knowledge base.☆21May 5, 2018Updated 8 years ago
- ☆18Apr 27, 2026Updated 2 months ago
- Awesome Power Query☆14Jun 14, 2026Updated 2 weeks ago
- Implementation of algorithms for semantic table implementation, including the TableMiner+ method☆19Sep 1, 2022Updated 3 years ago
- PowerQuery (M Language) AST and Parser in Haskell☆11Aug 6, 2020Updated 5 years ago
- Set-oriented Operations in Pandas☆24May 27, 2020Updated 6 years ago
- WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing,…☆114May 20, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆22Jun 10, 2020Updated 6 years ago
- Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data☆21Apr 14, 2024Updated 2 years ago
- Characterization of relational table embeddings (VLDB 2024).☆32Jul 1, 2024Updated last year
- ☆27Jan 31, 2019Updated 7 years ago
- Efficient set similarity search algorithms implemented in Go☆35Aug 27, 2022Updated 3 years ago
- Examples to run Hadoop/Spark cluster with kubernetes.☆12Feb 10, 2019Updated 7 years ago
- Image Captioning: Implementing the Neural Image Caption Generator☆21Oct 14, 2020Updated 5 years ago
- simialrity join or search on spark core directly☆28Jul 23, 2020Updated 5 years ago
- A python tool using XGboost and sentence-transformers to perform schema matching task on tables.☆42Mar 8, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository provides the implementation of several well-know INDs discovery algorithms☆13Nov 5, 2019Updated 6 years ago
- Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …☆30Apr 5, 2023Updated 3 years ago
- ☆20Jan 18, 2022Updated 4 years ago
- An integration of KùzuDB and RDFlib.☆18Nov 15, 2024Updated last year
- TopK Algorithms Benchmark☆10Jul 16, 2019Updated 6 years ago
- Stratosphere is now Apache Flink.☆201Dec 16, 2023Updated 2 years ago
- ECNU ICA seminar materials☆14Nov 23, 2022Updated 3 years ago
- The source code of the Sudowoodo paper in ICDE 2023☆19May 24, 2023Updated 3 years ago
- Autonomous Agent for Kubernetes☆15Feb 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …☆15Jun 23, 2024Updated 2 years ago
- Python Data Audit☆12Jul 24, 2020Updated 5 years ago
- init☆13Feb 3, 2021Updated 5 years ago
- A new framework to generate interpretable classification rules☆18Feb 11, 2023Updated 3 years ago
- A fast and accurate index for distribution-aware dataset search.☆10Feb 3, 2026Updated 4 months ago
- Python scripts for downloading and converting UCI data sets☆10Nov 19, 2024Updated last year
- mechanical-elephant.com☆11Jan 29, 2016Updated 10 years ago