Ankur3107 / dpr-tf
Dense Passage Retrieval using tensorflow-keras on TPU
β15Updated 3 years ago
Related projects β
Alternatives and complementary repositories for dpr-tf
- β20Updated 3 years ago
- Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCPβ58Updated 2 years ago
- Training T5 to perform numerical reasoning.β23Updated 3 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.β145Updated 3 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answeringβ60Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/β¦β23Updated 6 months ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+β37Updated 3 years ago
- β29Updated 3 years ago
- Embedding Recycling for Language modelsβ38Updated last year
- A π€-style implementation of BERT using lambda layers instead of self-attentionβ70Updated 4 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"β20Updated 4 years ago
- β73Updated 3 years ago
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)β18Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β92Updated last year
- Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://aβ¦β46Updated 2 years ago
- Helper scripts and notes that were used while porting various nlp modelsβ44Updated 2 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).β20Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)β46Updated 3 years ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrievalβ27Updated 2 years ago
- β16Updated last year
- Shared code for training sentence embeddings with Flax / JAXβ27Updated 3 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering modelsβ57Updated last year
- Language-agnostic BERT Sentence Embedding (LaBSE) Pytorch Modelβ21Updated 4 years ago
- reference pytorch code for intent classificationβ45Updated 3 weeks ago
- BERT models for many languages created from Wikipedia textsβ34Updated 4 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transforβ¦β47Updated last year
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorchβ75Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and β¦β49Updated 4 years ago
- β22Updated 2 years ago
- Open source library for few shot NLPβ77Updated last year