mitdbg/aurum-datadiscovery

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mitdbg/aurum-datadiscovery)

mitdbg / aurum-datadiscovery

☆78

Alternatives and similar repositories for aurum-datadiscovery

Users that are interested in aurum-datadiscovery are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

alex-bogatu / d3l
View on GitHub
D3L dataset discovery framework - an implementation of the ICDE 2020 paper with the same name: https://arxiv.org/pdf/2011.10427.pdf
☆21Nov 18, 2021Updated 4 years ago
mitdbg / lazo
View on GitHub
Sketch and LSH Index library for Java, including OPH methods as well as the Lazo method
☆15Dec 24, 2023Updated 2 years ago
olehmberg / WebTableStitching
View on GitHub
☆11Jul 21, 2017Updated 9 years ago
iai-group / www2018-table
View on GitHub
☆22Jan 3, 2023Updated 3 years ago
olehmberg / T2KMatch
View on GitHub
T2K Match is a matching algorithm optimised to match millions of web tables to a central knowledge base.
☆21May 5, 2018Updated 8 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
VIDA-NYU / auctus
View on GitHub
Mirror from: https://gitlab.com/ViDA-NYU/auctus/auctus
☆44May 12, 2025Updated last year
slipguru / adenine
View on GitHub
ADENINE: A Data ExploratioN PipelINE
☆15Jul 30, 2018Updated 7 years ago
NajiElKotob / Awesome-PowerQuery
View on GitHub
Awesome Power Query
☆14Updated this week
Atidot / language-powerquery
View on GitHub
PowerQuery (M Language) AST and Parser in Haskell
☆11Aug 6, 2020Updated 5 years ago
anhaidgroup / sparkly
View on GitHub
☆19Apr 27, 2026Updated 2 months ago
olehmberg / winter
View on GitHub
WInte.r is a Java framework for end-to-end data integration. The WInte.r framework implements well-known methods for data pre-processing,…
☆114May 20, 2022Updated 4 years ago
slp3-chinese / slp3-chinese
View on GitHub
Speech and Language Processing 中文翻译
☆21May 18, 2019Updated 7 years ago
IDEBench / IDEBench-public
View on GitHub
☆22Jun 10, 2020Updated 6 years ago
superctj / observatory
View on GitHub
Characterization of relational table embeddings (VLDB 2024).
☆32Jul 1, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
cchristodoulaki / Pytheas
View on GitHub
Pattern-based table discovery in Open Data CSV files
☆25Dec 8, 2022Updated 3 years ago
tdoehmen / hypoparsr
View on GitHub
☆27Jan 31, 2019Updated 7 years ago
xinglin / tpch
View on GitHub
TPC-H benchmark, specific for mysql
☆25Apr 18, 2013Updated 13 years ago
suhailshergill / TTFI
View on GitHub
typed tagless final interpreters
☆13Feb 14, 2017Updated 9 years ago
PKU-ICST-MIPL / MKVSE-TOMM2023
View on GitHub
☆28May 16, 2023Updated 3 years ago
tomchance / OpenEcoMaps
View on GitHub
Eco-living maps and data based on OpenStreetMap
☆17Jul 12, 2013Updated 13 years ago
vered1986 / UnsupervisedHypernymy
View on GitHub
Data and code for the experiments in: "Hypernyms under Siege: Linguistically-motivated Artillery for Hypernymy Detection". Vered Shwartz,…
☆51Jun 26, 2018Updated 8 years ago
fireindark707 / Python-Schema-Matching
View on GitHub
A python tool using XGboost and sentence-transformers to perform schema matching task on tables.
☆42Mar 8, 2026Updated 4 months ago
vraj-ucsd / ML-Data-Prep-Zoo
View on GitHub
☆31Nov 10, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ruc-datalab / Unicorn
View on GitHub
☆32Apr 15, 2023Updated 3 years ago
qcri / DeepBlocker
View on GitHub
Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …
☆30Apr 5, 2023Updated 3 years ago
UKPLab / SciGen
View on GitHub
☆21Jan 18, 2022Updated 4 years ago
david-abel / rl_info_theory
View on GitHub
A collection of code investigating the use of information theory for abstractions in RL
☆16Nov 14, 2018Updated 7 years ago
runawayhorse001 / PyAudit
View on GitHub
Python Data Audit
☆12Jul 24, 2020Updated 5 years ago
xllora / bwdrivers
View on GitHub
Storage driver implementations for BadWolf
☆16Sep 18, 2018Updated 7 years ago
zzh-SJTU / E5-Hierarchical-Table-Analysis
View on GitHub
The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …
☆15Jun 23, 2024Updated 2 years ago
cperales / uci-download-process
View on GitHub
Python scripts for downloading and converting UCI data sets
☆10Nov 19, 2024Updated last year
meelgroup / MLIC
View on GitHub
A new framework to generate interpretable classification rules
☆18Feb 11, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LuYanFCP / PocketFlow-rs
View on GitHub
Move to https://github.com/The-Pocket/PocketFlow-Rust
☆10May 7, 2025Updated last year
iai-group / nordlys
View on GitHub
Nordlys: Toolkit for entity-oriented and semantic search
☆31Mar 23, 2021Updated 5 years ago
questcollector / autogen-kubernetes
View on GitHub
support kubernetes feature for autogen(https://github.com/microsoft/autogen)
☆11Sep 15, 2025Updated 10 months ago
HPI-Information-Systems / Metanome
View on GitHub
The source repository of the Metanome tool
☆192Jun 5, 2025Updated last year
petukhovv / tree2vec
View on GitHub
AST factorization: transformation AST of Kotlin source code to a vector
☆11Oct 17, 2019Updated 6 years ago
iai-group / table-retrieval
View on GitHub
☆11Jan 3, 2023Updated 3 years ago
scravy / pysparkextra
View on GitHub
☆10Jun 29, 2021Updated 5 years ago