microsoft / Table-PretrainingLinks
ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor
☆296Updated 2 years ago
Alternatives and similar repositories for Table-Pretraining
Users that are interested in Table-Pretraining are comparing it to the libraries listed below
Sorting:
- TUTA and ForTaP for Structure-Aware and Numerical-Reasoning-Aware Table Pre-Training☆116Updated 7 months ago
- Scalable training for dense retrieval models.☆298Updated 2 weeks ago
- [EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models☆558Updated last year
- Dataset and code for EMNLP2020 paper "HybridQA: A Dataset of Multi-Hop Question Answeringover Tabular and Textual Data"☆230Updated 2 years ago
- [ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"☆318Updated last year
- Build Text Rerankers with Deep Language Models☆263Updated last year
- Code and Data for ICLR2021 Paper "Open Question Answering over Tables and Text"☆154Updated last year
- PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. PICARD is a ServiceNow Research project tha…☆360Updated last year
- TAT-QA (Tabular And Textual dataset for Question Answering) contains 16,552 questions associated with 2,757 hybrid contexts from real-wor…☆111Updated 6 months ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆728Updated 2 years ago
- PromptBERT: Improving BERT Sentence Embeddings with Prompts☆337Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆471Updated last year
- [EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674☆196Updated 2 years ago
- Fusion-in-Decoder☆572Updated last year
- Dataset for TACL 2022 paper: "FeTaQA: Free-form Table Question Answering"☆81Updated 2 years ago
- Search Engines with Autoregressive Language models☆288Updated 2 years ago
- [ACL 2022] A hierarchical table dataset for question answering and data-to-text generation.☆90Updated 3 months ago
- A dataset of complex questions on semi-structured Wikipedia tables☆164Updated 4 years ago
- ☆183Updated 2 years ago
- GAP-text2SQL: Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training☆104Updated last year
- Code and data for "TURL: Table Understanding through Representation Learning"☆122Updated 2 years ago
- The official implementation of the paper "RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQL"(EMNLP 2…☆65Updated 2 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆652Updated last week
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆539Updated last year
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆313Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆162Updated last year
- Materials for ACL-2022 tutorial: Knowledge-Augmented Methods for Natural Language Processing☆288Updated 2 years ago
- Data and Code for ICLR2020 Paper "TabFact: A Large-scale Dataset for Table-based Fact Verification"☆398Updated last year
- ☆283Updated last year
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆604Updated 3 years ago