AshwinDeshpande96 / Hierarchical-SoftmaxLinks
This is a scalable hierarchical softmax layer for Neural Networks with large output classes.
☆22Updated 5 years ago
Alternatives and similar repositories for Hierarchical-Softmax
Users that are interested in Hierarchical-Softmax are comparing it to the libraries listed below
Sorting:
- ECML 2019: Graph Neural Networks for Multi-Label Classification☆91Updated last year
- PyTorch implementation of BERT in "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"☆110Updated 7 years ago
- Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces☆58Updated 4 years ago
- Transformers are Graph Neural Networks!☆54Updated 4 years ago
- A smoother activation function (undergrad code)☆115Updated 5 years ago
- Checking the interpretability of attention on text classification models☆49Updated 6 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 4 years ago
- Code for Multi-Head Attention: Collaborate Instead of Concatenate☆153Updated 2 years ago
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆132Updated 4 years ago
- Code for NeurIPS 2019 paper "Hierarchical Optimal Transport for Document Representation"☆54Updated 5 years ago
- Unsupervised Data Augmentation experiments in PyTorch☆59Updated 6 years ago
- Two-Layer Hierarchical Softmax Implementation for PyTorch☆70Updated 4 years ago
- ☆19Updated 4 years ago
- Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from hu…☆43Updated 4 years ago
- Code for reproducing experiments in our ACL 2019 paper "Probing Neural Network Comprehension of Natural Language Arguments"☆54Updated 3 years ago
- Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing☆95Updated 5 years ago
- Implementation of Mixout with PyTorch☆75Updated 2 years ago
- Uncertainty-aware Self-training☆121Updated 2 years ago
- DECAF: Deep Extreme Classification with Label Features☆54Updated 3 years ago
- ECLARE: Extreme Classification with Label Graph Correlations☆42Updated 3 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Updated 4 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆46Updated 4 years ago
- PyTorch implementation of Lessons on Parameter Sharing across Layers in Transformers☆26Updated 4 years ago
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆66Updated 4 years ago
- An Inplementation of CRF (Conditional Random Fields) in PyTorch 1.0☆137Updated 5 years ago
- Transformers without Tears: Improving the Normalization of Self-Attention☆134Updated last year
- Multi-Task Deep Neural Networks for Natural Language Understanding☆166Updated 2 years ago
- Pytorch implementation of OpenAI-GPT for ROC stories☆51Updated 6 years ago
- Extremely simple and fast extreme multi-class and multi-label classifiers.☆70Updated last month
- Tutorial: implementing doc2vec (paragraph vectors) from scratch in PyTorch☆14Updated 6 years ago