HazyResearch / fm_data_tasks
Foundation Models for Data Tasks
☆102Updated last year
Alternatives and similar repositories for fm_data_tasks:
Users that are interested in fm_data_tasks are comparing it to the libraries listed below
- Code for extracting, parsing and annotating tables from GitTables (https://gittables.github.io).☆42Updated 3 years ago
- Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…☆22Updated 2 years ago
- The code base for paper: "ReAcTable: Enhancing ReAct for Table Question Answering"☆18Updated 8 months ago
- Characterization of relational table embeddings (VLDB 2024).☆25Updated 6 months ago
- ☆30Updated last year
- Resources for PVLDB 2023 submission☆24Updated 4 months ago
- ☆35Updated last year
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆38Updated 8 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆142Updated 8 months ago
- Retrieval as Attention☆83Updated 2 years ago
- ☆27Updated 2 months ago
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆17Updated this week
- Retrieval-Augmented Generation battle!☆48Updated last month
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆40Updated 6 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆45Updated last year
- Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data☆18Updated 9 months ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆22Updated 2 months ago
- Repository with an overview of the tutorial on Models and Practice of Neural Table Representations and up to date material for the hands-…☆20Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆109Updated last month
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆148Updated last year
- Benchmark Datasets for Set Similarity Search☆12Updated 5 years ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆30Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆91Updated last year
- ☆46Updated 6 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆153Updated last month
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆97Updated last year
- LangCode - Improving alignment and reasoning of large language models (LLMs) with natural language embedded program (NLEP).☆42Updated last year
- ☆27Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆40Updated 3 weeks ago