Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"
☆32Dec 16, 2020Updated 5 years ago
Alternatives and similar repositories for distilled-retriever-pytorch
Users that are interested in distilled-retriever-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆76Jan 14, 2021Updated 5 years ago
- Implementation of Kronecker Attention in Pytorch☆20Sep 12, 2020Updated 5 years ago
- ☆14Jun 29, 2024Updated last year
- Pytorch implementation of the hamburger module from the ICLR 2021 paper "Is Attention Better Than Matrix Decomposition"☆99Jan 13, 2021Updated 5 years ago
- Game code and data for Fool Me Twice: Entailment from Wikipedia Gamification https://arxiv.org/abs/2104.04725☆31Apr 17, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Improving Neural Text Generation with Reinforcement Learning☆23Jan 13, 2021Updated 5 years ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- Graph neural network message passing reframed as a Transformer with local attention☆70Dec 24, 2022Updated 3 years ago
- ☆18Nov 25, 2022Updated 3 years ago
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆37Apr 24, 2022Updated 4 years ago
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆129Jun 30, 2021Updated 4 years ago
- A simple implementation of a deep linear Pytorch module☆21Oct 16, 2020Updated 5 years ago
- A IO library for DotNet Framework☆10Nov 27, 2022Updated 3 years ago
- Take control back!☆18Jun 22, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- Code for the paper "Measuring Bias in Contextualized Word Representations"☆35Jul 19, 2019Updated 6 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆23May 1, 2022Updated 3 years ago
- Implementation of Lie Transformer, Equivariant Self-Attention, in Pytorch☆97Feb 19, 2021Updated 5 years ago
- Adam with minor modifications which give significant improvement☆19Aug 20, 2021Updated 4 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- Paper notes for Information Extraction, including Relation Extraction (RE), Named Entity Recognition (NER), Entity Linking (EL), Event Ex…☆17Apr 1, 2021Updated 5 years ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Apr 6, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆25Jul 11, 2024Updated last year
- Korean Parallel Corpus☆11Nov 27, 2014Updated 11 years ago
- Starter repo for regl explorations☆10May 26, 2017Updated 8 years ago
- ☆24Jun 14, 2019Updated 6 years ago
- Named Entity Recognition via Attention_based CNNs-BiLSTm-CRF☆15Jun 27, 2018Updated 7 years ago
- Implementation of TransGanFormer, an all-attention GAN that combines the finding from the recent GanFormer and TransGan paper☆155Apr 27, 2021Updated 5 years ago
- ☆67Oct 13, 2021Updated 4 years ago
- Implementation of "Arc-swift: A Novel Transition System for Dependency Parsing"☆32Aug 21, 2018Updated 7 years ago
- ☆11Jan 3, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Demo web server app that shows how BERT model trained on SQuAD dataset deals with the machine comprehension task.☆10Dec 8, 2022Updated 3 years ago
- Development server for Metalsmith.io with LiveReload capabilities☆16Feb 11, 2019Updated 7 years ago
- Implementation of the paper "Learning to Generate Questions by Learning What not to Generate"☆32May 15, 2019Updated 6 years ago
- A repository for converting between CoQA, SQuAD2, and QuAC and visualizing the data.☆24Dec 11, 2018Updated 7 years ago
- Code for "Dialogue State Induction Using Neural Latent Variable Models"☆26Oct 9, 2020Updated 5 years ago
- Temporal augmentation with two-stream ConvNet features on human action recognition☆18Apr 3, 2017Updated 9 years ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year