superctj/observatory

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/superctj/observatory)

superctj / observatory

Characterization of relational table embeddings (VLDB 2024).

☆32

Alternatives and similar repositories for observatory

Users that are interested in observatory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

madelonhulsebos / neural-table-representations-tutorial-2023
View on GitHub
Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…
☆21Jun 29, 2023Updated 3 years ago
northeastern-datalab / santos
View on GitHub
Implementation of SANTOS: Relationship-based Semantic Table Union Search.
☆14Nov 21, 2023Updated 2 years ago
megagonlabs / starmie
View on GitHub
Resources for PVLDB 2023 submission
☆29Aug 28, 2024Updated last year
megagonlabs / sudowoodo
View on GitHub
The source code of the Sudowoodo paper in ICDE 2023
☆19May 24, 2023Updated 3 years ago
SuDIS-ZJU / nlcTables
View on GitHub
☆15Jan 27, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
megagonlabs / doduo
View on GitHub
Annotating Columns with Pre-trained Language Models
☆35Jun 10, 2022Updated 4 years ago
alex-bogatu / d3l
View on GitHub
D3L dataset discovery framework - an implementation of the ICDE 2020 paper with the same name: https://arxiv.org/pdf/2011.10427.pdf
☆21Nov 18, 2021Updated 4 years ago
penfever / ArcheType
View on GitHub
ArcheType uses LLMs to automatically assign custom labels to your tabular data
☆19May 21, 2025Updated last year
awslabs / hypergraph-tabular-lm
View on GitHub
☆35Sep 7, 2024Updated last year
fireindark707 / Python-Schema-Matching
View on GitHub
A python tool using XGboost and sentence-transformers to perform schema matching task on tables.
☆42Mar 8, 2026Updated 4 months ago
delftdata / valentine
View on GitHub
A tool facilitating matching columns across tabular datasets. It also serves as an experiment suite for state-of-the-art schema matching …
☆124May 15, 2026Updated 2 months ago
madelonhulsebos / gittables
View on GitHub
Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).
☆50Apr 7, 2026Updated 3 months ago
target-benchmark / target
View on GitHub
TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL
☆29Jul 14, 2025Updated last year
guenthermi / table-embeddings
View on GitHub
Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data
☆21Apr 14, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
megagonlabs / sato
View on GitHub
Code and data for Sato https://arxiv.org/abs/1911.06311.
☆118Feb 23, 2024Updated 2 years ago
ysunbp / RECA-paper
View on GitHub
Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework
☆12May 7, 2025Updated last year
tsegall / fta
View on GitHub
Metadata/data identification Java library. Identifies Semantic Type information (e.g. Gender, Age, Color, Country,...). Extensive country…
☆33Jun 12, 2026Updated last month
SemBench / SemBench
View on GitHub
Benchmarking Semantic Query Processing Engines
☆63Updated this week
YSU-Data-Lab / TPC-H-Skew
View on GitHub
TPC-H benchmark with skew factor enabled
☆20Apr 17, 2025Updated last year
megagonlabs / rotom
View on GitHub
Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…
☆24May 31, 2022Updated 4 years ago
mitmedialab / sherlock-project
View on GitHub
This repository provides data and scripts to use Sherlock, a DL-based model for semantic data type detection: https://sherlock.media.mit.…
☆190Jul 30, 2024Updated last year
SuDIS-ZJU / rookies
View on GitHub
Rookie's guide
☆13Aug 10, 2024Updated last year
spapicchio / QATCH
View on GitHub
Official implementation of QATCH: Benchmarking SQL-centric tasks with Table Representation Learning Models on Your Data
☆33Jul 17, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
iai-group / table-retrieval
View on GitHub
☆11Jan 3, 2023Updated 3 years ago
SFIG611 / tabbie
View on GitHub
☆60Aug 17, 2022Updated 3 years ago
FeiWang96 / GTR
View on GitHub
[SIGIR 2021] Retrieving Complex Tables with Multi-Granular Graph Representation Learning.
☆47Sep 14, 2022Updated 3 years ago
ruc-datalab / Unicorn
View on GitHub
☆32Apr 15, 2023Updated 3 years ago
JZCS2018 / SMAT
View on GitHub
Model and datasets for schema matching
☆15Jul 17, 2021Updated 5 years ago
wbbeyourself / DTE
View on GitHub
Detect-Then-Explain Framework for Text-to-SQL task
☆10Dec 6, 2023Updated 2 years ago
Abonia1 / Fine-Tuning-LLMs-Key-Concepts-and-Terms
View on GitHub
Fine-tuning large language models (LLMs) is crucial for enhancing performance across domain-specific task applications. This comprehensiv…
☆13Sep 19, 2024Updated last year
twosixlabs / armory-example
View on GitHub
Example external repository for interacting with armory.
☆11May 2, 2022Updated 4 years ago
allenai / chime
View on GitHub
Repository containing dataset, models and code associated with the CHIME project
☆18Aug 22, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bfetahu / wiki_tables
View on GitHub
init
☆13Feb 3, 2021Updated 5 years ago
UKPLab / SciGen
View on GitHub
☆21Jan 18, 2022Updated 4 years ago
cwida / pvldbstyle
View on GitHub
PVLDB LaTeX style, based on acmart
☆16Apr 29, 2021Updated 5 years ago
j-r77 / cfddiscovery
View on GitHub
☆11Oct 31, 2019Updated 6 years ago
OSU-NLP-Group / TableLlama
View on GitHub
[NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".
☆137May 14, 2024Updated 2 years ago
zzh-SJTU / E5-Hierarchical-Table-Analysis
View on GitHub
The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …
☆15Jun 23, 2024Updated 2 years ago
DBGroup-SUSTech / multi-vector-retrieval
View on GitHub
☆15Apr 19, 2026Updated 3 months ago