Foundation Models for Data Tasks
☆110May 15, 2023Updated 2 years ago
Alternatives and similar repositories for fm_data_tasks
Users that are interested in fm_data_tasks are comparing it to the libraries listed below
Sorting:
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆23May 31, 2022Updated 3 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆21Oct 18, 2021Updated 4 years ago
- ☆15Mar 6, 2025Updated 11 months ago
- The BART Project: Benchmarking Algorithms for (data) Repairing and Translation☆42Nov 27, 2023Updated 2 years ago
- ☆32Apr 15, 2023Updated 2 years ago
- ☆18Jun 17, 2024Updated last year
- 📰 Computing the information content of trained neural networks☆22Oct 8, 2021Updated 4 years ago
- code for Build2Vec 1.0 reproducibility☆12Oct 28, 2021Updated 4 years ago
- Code for the paper "CollaborEM: A Self-supervised Entity Matching Framework Using Multi-features Collaboration". TKDE 2021.☆41Jul 12, 2022Updated 3 years ago
- This repo contains code for the paper: "Can Foundation Models Help Us Achieve Perfect Secrecy?"☆24Feb 9, 2023Updated 3 years ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 2 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Apr 1, 2023Updated 2 years ago
- Prompt programming with FMs.☆445Jul 22, 2024Updated last year
- Code for the paper "Deep Entity Matching with Pre-trained Language Models"☆304Apr 17, 2024Updated last year
- ☆14Nov 26, 2022Updated 3 years ago
- Extended Annotations of DeepFashion Images for Fine-grained Recognition☆14May 28, 2019Updated 6 years ago
- Authors' implementation of the paper "Equivariant Networks for Pixelized Spheres" published at ICML 2021.☆12Feb 23, 2022Updated 4 years ago
- An uncertainty-based random sampling algorithm for data augmentation☆30Sep 7, 2020Updated 5 years ago
- Continuous Benchmark of Filtering methods for Entity Resolution☆11Jul 20, 2025Updated 7 months ago
- Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"☆13Oct 25, 2021Updated 4 years ago
- Code examples for "Under the hood of calling C/C++ from Python"☆13Sep 16, 2020Updated 5 years ago
- ☆14Mar 18, 2022Updated 3 years ago
- ☆13Feb 25, 2022Updated 4 years ago
- Deadline-based hyperparameter tuning on RayTune.☆32Jan 16, 2020Updated 6 years ago
- Clinical Text Mining☆12Aug 15, 2017Updated 8 years ago
- Simulations for predictive model selection in causal inference☆13Jan 16, 2025Updated last year
- Repository for DNN training, theory to practice, part of the Large Scale Machine Learning class at Mines Paritech☆12Mar 11, 2022Updated 3 years ago
- Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework☆12May 7, 2025Updated 9 months ago
- ☆11May 11, 2022Updated 3 years ago
- Code for ICLR 2022 Paper, "Controlling Directions Orthogonal to a Classifier"☆35Jun 6, 2023Updated 2 years ago
- Simple dataset to dataloader library for pytorch☆32Jan 3, 2025Updated last year
- This repository contains code and extensive prompt examples to reproduce and extend the experiments in our papers "Using ChatGPT for Enti…☆65Oct 18, 2024Updated last year
- Welcome to Snowman App – a Data Matching Benchmark Platform.☆38Feb 9, 2023Updated 3 years ago
- DeepDive Biomedical Tools☆15Apr 3, 2017Updated 8 years ago
- params_proto, a collection of decorators that makes shell argument passing declarative☆19Feb 5, 2026Updated 3 weeks ago
- ☆14Mar 13, 2021Updated 4 years ago
- The opinionated machine learning experimentation framework☆13Jun 16, 2021Updated 4 years ago
- A collection of simple tutorials for using Fonduer☆101Oct 27, 2020Updated 5 years ago