microsoft / Table-Pretraining
ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor
☆290Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Table-Pretraining
- [EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models☆550Updated last year
- Scalable training for dense retrieval models.☆271Updated last year
- Semantic Evaluation for Text-to-SQL with Distilled Test Suites☆230Updated 5 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆462Updated last month
- Fusion-in-Decoder☆549Updated last year
- ☆265Updated 11 months ago
- Dataset and code for EMNLP2020 paper "HybridQA: A Dataset of Multi-Hop Question Answeringover Tabular and Textual Data"☆223Updated last year
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆518Updated 11 months ago
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆685Updated last year
- TUTA and ForTaP for Structure-Aware and Numerical-Reasoning-Aware Table Pre-Training☆98Updated this week
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆231Updated last year
- A Survey of Attributions for Large Language Models☆169Updated 2 months ago
- Tevatron - A flexible toolkit for neural retrieval research and development.☆530Updated last month
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆240Updated last year
- Code and Data for ICLR2021 Paper "Open Question Answering over Tables and Text"☆153Updated 10 months ago
- Multiple paper open-source codes of the Microsoft Research Asia DKI group☆374Updated last year
- A dataset of complex questions on semi-structured Wikipedia tables☆154Updated 3 years ago
- Introduction page of a challenging text-to-SQL dataset: KaggleDBQA☆34Updated last year
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆194Updated last year
- The Pytorch implementation of RESDSQL (AAAI 2023).☆243Updated 6 months ago
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing☆288Updated 2 years ago
- ☆202Updated last year
- [ACL 2021] This is the project containing source codes and pre-trained models about ACL2021 Long Paper ``LGESQL: Line Graph Enhanced Text…☆147Updated 2 years ago
- Codebase for RetroMAE and beyond.☆240Updated 5 months ago
- PromptBERT: Improving BERT Sentence Embeddings with Prompts☆333Updated last year
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆80Updated last year
- ☆122Updated 2 months ago
- TAT-QA (Tabular And Textual dataset for Question Answering) contains 16,552 questions associated with 2,757 hybrid contexts from real-wor…☆97Updated 2 months ago
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆193Updated 9 months ago
- Translating natural language questions to a structured query language☆223Updated last year