Ankur3107 / dpr-tf
Dense Passage Retrieval using tensorflow-keras on TPU
ā15Updated 3 years ago
Alternatives and similar repositories for dpr-tf:
Users that are interested in dpr-tf are comparing it to the libraries listed below
- Tutorial to pretrain & fine-tune a š¤ Flax T5 model on a TPUv3-8 with GCPā58Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.ā145Updated 3 years ago
- ā74Updated 3 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transferā39Updated 4 years ago
- Shared code for training sentence embeddings with Flax / JAXā27Updated 3 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer ā¦ā55Updated 4 years ago
- BERT models for many languages created from Wikipedia textsā34Updated 4 years ago
- ā21Updated 3 years ago
- Embedding Recycling for Language modelsā38Updated last year
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+ā37Updated 3 years ago
- Helper scripts and notes that were used while porting various nlp modelsā45Updated 2 years ago
- Training T5 to perform numerical reasoning.ā23Updated 3 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answeringā62Updated 3 years ago
- Execute arbitrary SQL queries on š¤ Datasetsā32Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pā¦ā34Updated last year
- A python library for highly configurable transformers - easing model architecture search and experimentation.ā49Updated 3 years ago
- My explorations into editing the knowledge and memories of an attention networkā34Updated 2 years ago
- This repository contains example code to build models on TPUsā30Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.ā93Updated last year
- ā17Updated last year
- A š¤-style implementation of BERT using lambda layers instead of self-attentionā69Updated 4 years ago
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorchā72Updated 2 years ago
- Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementationā31Updated last year
- ā28Updated last year
- PyTorch implementation of GLOMā21Updated 2 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorchā75Updated 4 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"ā21Updated 4 years ago
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)ā19Updated 2 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/ā¦ā25Updated 9 months ago
- ā12Updated 2 years ago