HazyResearch / fm_data_tasks
Foundation Models for Data Tasks
☆105Updated last year
Alternatives and similar repositories for fm_data_tasks:
Users that are interested in fm_data_tasks are comparing it to the libraries listed below
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆43Updated 3 years ago
- Characterization of relational table embeddings (VLDB 2024).☆27Updated 9 months ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆22Updated 2 years ago
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆21Updated this week
- ☆30Updated last year
- Resources for PVLDB 2023 submission☆25Updated 7 months ago
- The code base for paper: "ReAcTable: Enhancing ReAct for Table Question Answering"☆26Updated 11 months ago
- ☆60Updated 2 years ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆46Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- ☆28Updated 2 years ago
- The source code of the Sudowoodo paper in ICDE 2023☆15Updated last year
- ☆51Updated 9 months ago
- The code and data for paper "Large Language Models are few(1)-shot Table Reasoners" [EACL2023]☆46Updated 11 months ago
- ☆39Updated 2 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆19Updated 3 years ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- Code for paper Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding☆63Updated 9 months ago
- ☆29Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆20Updated last year
- ☆69Updated 10 months ago
- A repository to perform self-instruct with a model on HF Hub☆32Updated last year
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆100Updated 2 months ago
- [SUKI'22] Table Retrieval May Not Necessitate Table-Specific Model Design☆21Updated 2 years ago
- Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval (NeurIPS'21)☆44Updated 3 years ago
- Retrieval as Attention☆83Updated 2 years ago
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Updated 2 years ago
- Code and data for "TURL: Table Understanding through Representation Learning"☆121Updated 2 years ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated 8 months ago