HazyResearch/fm_data_tasks

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HazyResearch/fm_data_tasks)

HazyResearch / fm_data_tasks

Foundation Models for Data Tasks

☆112

Alternatives and similar repositories for fm_data_tasks

Users that are interested in fm_data_tasks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EliasMei / IPM
View on GitHub
Repo - Paper "Capturing Semantics for Imputation with Pre-trained Language Models." [ICDE 2021]
☆10Mar 13, 2022Updated 4 years ago
megagonlabs / rotom
View on GitHub
Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…
☆24May 31, 2022Updated 4 years ago
megagonlabs / starmie
View on GitHub
Resources for PVLDB 2023 submission
☆29Aug 28, 2024Updated last year
wbsg-uni-mannheim / TabAnnGPT
View on GitHub
This repository contains code and data for reproducing the experiments of three papers that focus on two subtasks of table annotation: co…
☆12Mar 5, 2025Updated last year
qcri / DeepBlocker
View on GitHub
Repository for performing Blocking using Deep Learning based on the paper "Deep Learning for Blocking in Entity Matching: A Design Space …
☆30Apr 5, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
megagonlabs / machamp
View on GitHub
The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021
☆22Oct 18, 2021Updated 4 years ago
amazon-science / wikiwiki-dataset
View on GitHub
☆11May 11, 2022Updated 4 years ago
ekzhu / josie
View on GitHub
Code and Benchmarks for JOSIE (SIGMOD 2019)
☆20Apr 13, 2023Updated 3 years ago
zhisbug / ray-scalable-ml-design
View on GitHub
Some microbenchmarks and design docs before commencement
☆11Feb 1, 2021Updated 5 years ago
icip-cas / EntityMatcher
View on GitHub
☆18Jun 17, 2024Updated 2 years ago
cpitclaudel / dBoost
View on GitHub
☆18Dec 3, 2015Updated 10 years ago
ysunbp / RECA-paper
View on GitHub
Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework
☆12May 7, 2025Updated last year
dbunibas / BART
View on GitHub
The BART Project: Benchmarking Algorithms for (data) Repairing and Translation
☆43Nov 27, 2023Updated 2 years ago
wbsg-uni-mannheim / contrastive-product-matching
View on GitHub
This repository contains the code to reproduce the experiments of the poster "Supervised Contrastive Learning for Product Matching"
☆38Feb 11, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
anhaidgroup / py_entitymatching
View on GitHub
☆193May 29, 2024Updated 2 years ago
jacopotagliabue / foundation-models-for-dbt-entity-matching
View on GitHub
Playground for using large language models into the Modern Data Stack for entity matching
☆110Apr 1, 2023Updated 3 years ago
HazyResearch / ama_prompting
View on GitHub
Ask Me Anything language model prompting
☆548Jul 5, 2023Updated 3 years ago
LoveCatc / supervised-llm-uncertainty-estimation
View on GitHub
This repo contains code for paper: "Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach".
☆26Oct 21, 2024Updated last year
ZJU-DAILY / CollaborEM
View on GitHub
Code for the paper "CollaborEM: A Self-supervised Entity Matching Framework Using Multi-features Collaboration". TKDE 2021.
☆41Jul 12, 2022Updated 4 years ago
HazyResearch / TART
View on GitHub
TART: A plug-and-play Transformer module for task-agnostic reasoning
☆201Jun 22, 2023Updated 3 years ago
bradleypallen / fb15k-akbc
View on GitHub
A set of Jupyter notebooks capturing an effort to apply Keras to the problem of automatic knowledge base construction.
☆11Aug 30, 2016Updated 9 years ago
vid-koci / KBCtransferlearning
View on GitHub
Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"
☆15Feb 21, 2024Updated 2 years ago
HazyResearch / numbskull
View on GitHub
Numba-based version of DimmWitted Gibbs sampler
☆47May 14, 2018Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
adefossez / dnn_theo_practice
View on GitHub
Repository for DNN training, theory to practice, part of the Large Scale Machine Learning class at Mines Paritech
☆11Mar 11, 2022Updated 4 years ago
wbsg-uni-mannheim / wdc-lspc-v2
View on GitHub
This repository contains code and data download scripts for the paper "Using schema.org annotations for training and maintaining product …
☆16Aug 29, 2023Updated 2 years ago
ArjitJ / DIAL
View on GitHub
Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"
☆17Dec 20, 2021Updated 4 years ago
LHRLAB / HAHE
View on GitHub
[ACL 2023] Official resources of "HAHE: Hierarchical Attention for Hyper-Relational Knowledge Graphs in Global and Local Level".
☆28Aug 18, 2025Updated 11 months ago
yeyupiaoling / Chinese-LLM-Chat
View on GitHub
大语言模型微调的项目，包含了使用QLora微调ChatGLM和LLama
☆29Jun 26, 2023Updated 3 years ago
HoloClean / HoloClean-Legacy-deprecated
View on GitHub
A Machine Learning System for Data Enrichment.
☆76Sep 15, 2018Updated 7 years ago
abachaa / MeQSum
View on GitHub
Dataset for medical question summarization introduced in the ACL 2019 paper "On the Summarization of Consumer Health Questions" (A. Ben A…
☆33May 13, 2026Updated 2 months ago
HazyResearch / flyingsquid
View on GitHub
More interactive weak supervision with FlyingSquid
☆315Sep 1, 2020Updated 5 years ago
chu-data-lab / zeroer
View on GitHub
Entity resolution using zero labeled examples
☆34Jun 29, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sameersingh / er-visualizer
View on GitHub
D3 and Play based visualization for entity-relation graphs, especially for NLP and information extraction
☆31Aug 6, 2015Updated 10 years ago
congy / AutoSuggest
View on GitHub
☆14Mar 13, 2021Updated 5 years ago
henryre / numba-plsa
View on GitHub
PLSA for sparse matrices implemented with Numba
☆11Oct 18, 2016Updated 9 years ago
oriyor / reasoning-on-cots
View on GitHub
Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"
☆97Jan 21, 2024Updated 2 years ago
intellectronica / battle-of-the-semantics
View on GitHub
GraphRag vs Embeddings
☆16Jul 14, 2024Updated 2 years ago
Newbeeer / orthogonal_classifier
View on GitHub
Code for ICLR 2022 Paper, "Controlling Directions Orthogonal to a Classifier"
☆35Jun 6, 2023Updated 3 years ago
buds-lab / build2vec-thermal-comfort
View on GitHub
code for Build2Vec 1.0 reproducibility
☆13Oct 28, 2021Updated 4 years ago