EMNLP 2018: Multi-Head Attention with Disagreement Regularization; NAACL 2019: Information Aggregation for Multi-Head Attention with Routing-by-Agreement
☆21Oct 9, 2020Updated 5 years ago
Alternatives and similar repositories for Diversify-MHA
Users that are interested in Diversify-MHA are comparing it to the libraries listed below
Sorting:
- Code for the SIGIR 2020 paper "A Unified Dual-view Model for Review Summarization and Sentiment Classification with Inconsistency Loss"☆21Feb 3, 2021Updated 5 years ago
- NILE : Natural Language Inference with Faithful Natural Language Explanations☆30Jun 12, 2023Updated 2 years ago
- Computing calibrated prediction intervals for neural network regressors☆10May 28, 2019Updated 6 years ago
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Cha…☆11Apr 16, 2021Updated 4 years ago
- ☆12Jun 29, 2025Updated 8 months ago
- ☆15Jan 24, 2019Updated 7 years ago
- VS Code tools for NextBASIC☆12Apr 22, 2025Updated 10 months ago
- ☆12Feb 22, 2021Updated 5 years ago
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Explore various machine learning techniques to do time series prediction.☆11Apr 13, 2019Updated 6 years ago
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- ☆10Sep 14, 2022Updated 3 years ago
- Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)☆12Oct 25, 2018Updated 7 years ago
- A collection of Z80 assembler related projects developed as both an educational and personal resource.☆14May 16, 2018Updated 7 years ago
- Predicting Unplanned Hospital Readmission Using Natural Language Processing of MIMICIII Discharge Notes☆12Feb 12, 2019Updated 7 years ago
- Numpy implementation of Gaussian Process Regression☆11May 27, 2019Updated 6 years ago
- Simple Pygame application for Particle Effect showcasing (Tutorial)☆10Nov 11, 2023Updated 2 years ago
- ☆10Apr 8, 2024Updated last year
- ☆10Nov 15, 2021Updated 4 years ago
- Word Familiarity Rate for 'Word List by Semantic Principles (WLSP)'☆12Jan 2, 2025Updated last year
- Minimalist Operating System designed to implement as much functionality as possible with a budget of 1000 Lines of Code☆12Sep 28, 2016Updated 9 years ago
- A 3D library for the ZX Spectrum Next☆27Nov 30, 2025Updated 3 months ago
- Homework questions from the Coursera/Stanford course Mining Massibve Datasets. Question, no answers.☆11Nov 22, 2014Updated 11 years ago
- text to speech☆10Mar 19, 2024Updated last year
- CNN Image Retrieval Model Weights Ported☆12Jun 2, 2018Updated 7 years ago
- ☆11May 26, 2020Updated 5 years ago
- ☆10Mar 14, 2021Updated 4 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Dec 6, 2013Updated 12 years ago
- A straightforward implementation for Progressive Growing of GANs☆10Jun 20, 2018Updated 7 years ago
- normalizer of numerical / temporal expression☆11Sep 2, 2018Updated 7 years ago
- code for manuscript "Synthesizing CT Images from MR Images with Deep Learning: Model Generalization for Different Datasets through Transf…☆13Apr 23, 2021Updated 4 years ago
- Video Audio Translation Tool - automatically subtitles and dubs videos☆13Mar 16, 2020Updated 5 years ago
- notes on reading tensorflow source code☆13Aug 18, 2018Updated 7 years ago
- ☆12Jan 25, 2018Updated 8 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Dec 11, 2025Updated 2 months ago
- Zx Spectrum Uart comms tools.☆12Apr 14, 2020Updated 5 years ago
- ☆13Feb 28, 2024Updated 2 years ago
- ☆13Nov 16, 2020Updated 5 years ago