EMNLP 2018: Multi-Head Attention with Disagreement Regularization; NAACL 2019: Information Aggregation for Multi-Head Attention with Routing-by-Agreement
☆21Oct 9, 2020Updated 5 years ago
Alternatives and similar repositories for Diversify-MHA
Users that are interested in Diversify-MHA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP2019] Rethinking Attribute Representation and Injection for Sentiment Classification☆22Jan 2, 2020Updated 6 years ago
- Code for the SIGIR 2020 paper "A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss"☆21Feb 3, 2021Updated 5 years ago
- An implementation of a Capsule Attention Network.☆10Jan 26, 2018Updated 8 years ago
- Predicting Unplanned Hospital Readmission Using Natural Language Processing of MIMICIII Discharge Notes☆12Feb 12, 2019Updated 7 years ago
- Language Models as Semantic Indexers (ICML 2024)☆40May 2, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for our paper: "Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers".☆20Dec 28, 2021Updated 4 years ago
- Readmission Prediction via Deep Contextual Embedding of Clinical Concepts☆18Dec 23, 2017Updated 8 years ago
- Official code and data for EMNLP 2020 paper "Cross-Media Keyphrase Prediction: A Unified Framework with Multi-Modality Multi-Head Attenti…☆21Nov 27, 2020Updated 5 years ago
- 随机扒取古诗文词语作为git的commit msg☆11Jan 16, 2017Updated 9 years ago
- Predictive Modeling in Urgent Care - A Comparative Study of Machine Learning Approaches☆22Mar 14, 2019Updated 7 years ago
- ☆15Aug 13, 2020Updated 5 years ago
- Source code and dataset for TKDE'22 paper "Region or Global? A Principle for Negative Sampling in Graph-based Recommendation"☆13Mar 15, 2022Updated 4 years ago
- ☆11Oct 15, 2020Updated 5 years ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Jun 21, 2022Updated 3 years ago
- ☆13Dec 18, 2023Updated 2 years ago
- A conda-smithy repository for jaxlib.☆17Mar 26, 2026Updated 2 weeks ago
- Multilingual hierarchical attention networks toolkit☆78Nov 27, 2019Updated 6 years ago
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated last year
- This is the source code for the paper 'Analysis and Prediction of Unplanned Intensive Care Unit Readmission' published in PLoS ONE July☆27Dec 9, 2018Updated 7 years ago
- Non-invasive wearable circadian rhythm telemonitoring sensors☆17Apr 16, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated 2 years ago
- PyTorch implementation of https://arxiv.org/abs/1711.02536☆12Jan 11, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Offical Repo for Splitting Steepest Descent for Growing Neural Architectures☆13May 12, 2021Updated 4 years ago
- Kaggle: Quora Insincere Questions Classification - detect toxic content to improve online conversations☆36Dec 23, 2018Updated 7 years ago
- running LayoutLMv2☆11Apr 27, 2022Updated 3 years ago
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- mixed membership stochastic block model☆13Jun 8, 2016Updated 9 years ago
- Deep Generative Models (Chainer)☆10Oct 12, 2017Updated 8 years ago
- the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering☆13Aug 22, 2023Updated 2 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆71Mar 19, 2021Updated 5 years ago
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contr…☆50Oct 12, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)☆12Oct 25, 2018Updated 7 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- ☆12Jan 25, 2018Updated 8 years ago
- ☆12Feb 22, 2021Updated 5 years ago
- ☆11Jul 31, 2022Updated 3 years ago
- Hydrogen Community Edition☆19Jan 4, 2023Updated 3 years ago
- TensorFlow code and pre-trained models for BERT☆11May 2, 2019Updated 6 years ago