AshwinDeshpande96 / Hierarchical-Softmax
This is a scalable hierarchical softmax layer for Neural Networks with large output classes.
☆19Updated 4 years ago
Alternatives and similar repositories for Hierarchical-Softmax:
Users that are interested in Hierarchical-Softmax are comparing it to the libraries listed below
- PyTorch implementation of BERT in "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"☆98Updated 6 years ago
- PyTorch implementation of the paper "Hyperbolic Interaction Model For Hierarchical Multi-Label Classification"☆48Updated 5 years ago
- ECML 2019: Graph Neural Networks for Multi-Label Classification☆90Updated 7 months ago
- PyTorch implementation for the paper Classification from Positive, Unlabeled and Biased Negative Data.☆19Updated last year
- A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"☆72Updated 2 years ago
- [ACL‘20] Highway Transformer: A Gated Transformer.☆32Updated 3 years ago
- Implementation of the "Poincare Glove: Hyperbolic word embeddings" paper☆85Updated 4 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch☆45Updated 3 years ago
- Code for NeurIPS 2019 paper "Hierarchical Optimal Transport for Document Representation"☆54Updated 5 years ago
- Differentiable Product Quantization for End-to-End Embedding Compression.☆59Updated 2 years ago
- Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces☆58Updated 3 years ago
- Code for Attention Word Embeddings☆20Updated 4 years ago
- Train poincare embedding using gensim☆19Updated 6 years ago
- Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".☆64Updated 3 years ago
- Code for the paper Joint Learning of Hyperbolic Label Embeddings for Hierarchical Multi-label Classification (EACL '21)☆23Updated 3 years ago
- Implementation of experiments in paper "Learning from Rules Generalizing Labeled Exemplars" to appear in ICLR2020 (https://openreview.net…☆49Updated last year
- Code for paper 'Minimizing FLOPs to Learn Efficient Sparse Representations' published at ICLR 2020☆20Updated 5 years ago
- Language Model Baselines for PyTorch☆42Updated 4 years ago
- ☆50Updated last year
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Updated 3 years ago
- ☆63Updated 2 years ago
- An Unsupervised Sentence Embedding Method by Mutual Information Maximization (EMNLP2020)☆61Updated 4 years ago
- Implementation of Mixout with PyTorch☆74Updated 2 years ago
- Code for reproducing experiments in our ACL 2019 paper "Probing Neural Network Comprehension of Natural Language Arguments"☆53Updated 2 years ago
- ☆24Updated 4 years ago
- Codes for <Kernelized Bayesian Softmax for Text Generation> in NeurIPS 2019☆16Updated 5 years ago
- The implementation of multi-branch attentive Transformer (MAT).☆33Updated 4 years ago
- Code for Paper: Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data☆35Updated 4 years ago
- PyTorch Implementation of Zero-shot User Intent Detection via Capsule Neural Networks☆18Updated 5 years ago
- ☆63Updated 4 years ago