Ankur3107 / dpr-tfLinks
Dense Passage Retrieval using tensorflow-keras on TPU
β15Updated 4 years ago
Alternatives and similar repositories for dpr-tf
Users that are interested in dpr-tf are comparing it to the libraries listed below
Sorting:
- Tutorial to pretrain & fine-tune a π€ Flax T5 model on a TPUv3-8 with GCPβ58Updated 2 years ago
- Shared code for training sentence embeddings with Flax / JAXβ27Updated 3 years ago
- Helper scripts and notes that were used while porting various nlp modelsβ46Updated 3 years ago
- Embedding Recycling for Language modelsβ38Updated last year
- β28Updated 2 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answeringβ63Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.β147Updated 3 years ago
- Ranking of fine-tuned HF models as base models.β35Updated last month
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+β38Updated 4 years ago
- Generate BERT vocabularies and pretraining examples from Wikipediasβ17Updated 5 years ago
- KitanaQA: Adversarial training and data augmentation for neural question-answering modelsβ57Updated last year
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorchβ76Updated 4 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β93Updated 2 years ago
- π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.β82Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and β¦β51Updated 6 months ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queriesβ19Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)β48Updated 3 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.β49Updated 3 years ago
- Implementation of Nested Named Entity Recognition using Flairβ24Updated 3 years ago
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"β21Updated 2 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"β21Updated 4 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer β¦β55Updated 4 years ago
- β12Updated 6 months ago
- β76Updated 3 years ago
- A π€-style implementation of BERT using lambda layers instead of self-attentionβ69Updated 4 years ago
- β21Updated 3 years ago
- Data programming by demonstration for information extraction and span annotationβ35Updated 3 years ago
- My explorations into editing the knowledge and memories of an attention networkβ35Updated 2 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transforβ¦β47Updated 2 years ago
- ELECTRA MODEL NLPβ13Updated 5 years ago