microsoft / Table-Pretraining
ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor
☆288Updated last year
Related projects: ⓘ
- [EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models☆546Updated last year
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆219Updated last year
- Scalable training for dense retrieval models.☆268Updated last year
- Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…☆508Updated 9 months ago
- Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning☆658Updated last year
- Fusion-in-Decoder☆548Updated 11 months ago
- Dataset and code for EMNLP2020 paper "HybridQA: A Dataset of Multi-Hop Question Answeringover Tabular and Textual Data"☆217Updated last year
- ☆259Updated 9 months ago
- Semantic Evaluation for Text-to-SQL with Distilled Test Suites☆209Updated 3 months ago
- Build Text Rerankers with Deep Language Models☆245Updated 7 months ago
- ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels …☆235Updated last year
- A Survey of Attributions for Large Language Models☆155Updated 3 weeks ago
- Code used for sourcing and cleaning the BigScience ROOTS corpus☆299Updated last year
- [NeurIPS'22 Spotlight] A Contrastive Framework for Neural Text Generation☆461Updated 6 months ago
- Prod Env☆375Updated 11 months ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆442Updated last month
- TUTA and ForTaP for Structure-Aware and Numerical-Reasoning-Aware Table Pre-Training☆96Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆386Updated 7 months ago
- A dataset of complex questions on semi-structured Wikipedia tables☆145Updated 3 years ago
- [ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"☆295Updated last year
- PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. PICARD is a ServiceNow Research project tha…☆339Updated 11 months ago
- Tevatron - A flexible toolkit for neural retrieval research and development.☆479Updated last month
- Repository for EMNLP 2022 Paper: Towards a Unified Multi-Dimensional Evaluator for Text Generation☆187Updated 7 months ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆605Updated 2 years ago
- The official implementation of the paper "RASAT: Integrating Relational Structures into Pretrained Seq2Seq Model for Text-to-SQL"(EMNLP 2…☆62Updated last year
- Code and data for "TURL: Table Understanding through Representation Learning"☆115Updated 2 years ago
- An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi☆249Updated last year
- DSIR large-scale data selection framework for language model training☆221Updated 5 months ago
- ☆292Updated last year
- ☆246Updated 9 months ago