HazyResearch / fm_data_tasks
Foundation Models for Data Tasks
☆102Updated last year
Alternatives and similar repositories for fm_data_tasks:
Users that are interested in fm_data_tasks are comparing it to the libraries listed below
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆43Updated 3 years ago
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆21Updated 3 weeks ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆99Updated last month
- Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)☆84Updated last year
- ☆30Updated last year
- Characterization of relational table embeddings (VLDB 2024).☆25Updated 8 months ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆22Updated 2 years ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆41Updated 2 weeks ago
- The code base for paper: "ReAcTable: Enhancing ReAct for Table Question Answering"☆25Updated 10 months ago
- Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)☆85Updated 2 years ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated 8 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆47Updated last year
- ☆32Updated 2 weeks ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆44Updated 2 months ago
- Data Benchmarking☆19Updated 9 months ago
- ☆29Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆151Updated last year
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆94Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- Retrieval as Attention☆83Updated 2 years ago
- Pretraining Efficiently on S2ORC!☆158Updated 4 months ago
- ☆27Updated 4 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆113Updated 3 months ago
- ☆49Updated last year
- ☆59Updated 2 years ago
- The dataset for the paper "Machamp: A Generalized Entity Matching Benchmark" published in CIKM 2021☆19Updated 3 years ago
- ☆49Updated 8 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.