AshwinDeshpande96 / Hierarchical-Softmax

This is a scalable hierarchical softmax layer for Neural Networks with large output classes.

☆19

Alternatives and similar repositories for Hierarchical-Softmax:

Users that are interested in Hierarchical-Softmax are comparing it to the libraries listed below

dreamgonfly / BERT-pytorch
PyTorch implementation of BERT in "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding"
☆98Updated 6 years ago
bcol23 / HyperIM
PyTorch implementation of the paper "Hyperbolic Interaction Model For Hierarchical Multi-Label Classification"
☆48Updated 5 years ago
QData / LaMP
ECML 2019: Graph Neural Networks for Multi-Label Classification
☆90Updated 7 months ago
cyber-meow / PUbiasedN
PyTorch implementation for the paper Classification from Positive, Unlabeled and Biased Negative Data.
☆19Updated last year
10-zin / Synthesizer
A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"
☆72Updated 2 years ago
cyk1337 / Highway-Transformer
[ACL‘20] Highway Transformer: A Gated Transformer.
☆32Updated 3 years ago
alex-tifrea / poincare_glove
Implementation of the "Poincare Glove: Hyperbolic word embeddings" paper
☆85Updated 4 years ago
lucidrains / coco-lm-pytorch
Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch
☆45Updated 3 years ago
IBM / HOTT
Code for NeurIPS 2019 paper "Hierarchical Optimal Transport for Document Representation"
☆54Updated 5 years ago
chentingpc / dpq_embedding_compression
Differentiable Product Quantization for End-to-End Embedding Compression.
☆59Updated 2 years ago
FranxYao / PoincareProbe
Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces
☆58Updated 3 years ago
luffycodes / attention-word-embedding
Code for Attention Word Embeddings
☆20Updated 4 years ago
harmanpreet93 / poincare-embedding-using-gensim
Train poincare embedding using gensim
☆19Updated 6 years ago
JetRunner / PABEE
Code for the paper "BERT Loses Patience: Fast and Robust Inference with Early Exit".
☆64Updated 3 years ago
soumyac1999 / hyperbolic-label-emb-for-hmc
Code for the paper Joint Learning of Hyperbolic Label Embeddings for Hierarchical Multi-label Classification (EACL '21)
☆23Updated 3 years ago
awasthiabhijeet / Learning-From-Rules
Implementation of experiments in paper "Learning from Rules Generalizing Labeled Exemplars" to appear in ICLR2020 (https://openreview.net…
☆49Updated last year
biswajitsc / sparse-embed
Code for paper 'Minimizing FLOPs to Learn Efficient Sparse Representations' published at ICLR 2020
☆20Updated 5 years ago
ChengyueGongR / advsoft
Language Model Baselines for PyTorch
☆42Updated 4 years ago
lxk00 / BERT-EMD
☆50Updated last year
carolinlawrence / gradient-rollback
Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…
☆21Updated 3 years ago
nng555 / ssmba
☆63Updated 2 years ago
yanzhangnlp / IS-BERT
An Unsupervised Sentence Embedding Method by Mutual Information Maximization (EMNLP2020)
☆61Updated 4 years ago
bloodwass / mixout
Implementation of Mixout with PyTorch
☆74Updated 2 years ago
IKMLab / arct2
Code for reproducing experiments in our ACL 2019 paper "Probing Neural Network Comprehension of Natural Language Arguments"
☆53Updated 2 years ago
eaglenlp / Text-Matching
☆24Updated 4 years ago
NingMiao / KerBS
Codes for <Kernelized Bayesian Softmax for Text Generation> in NeurIPS 2019
☆16Updated 5 years ago
HA-Transformer / MAT
The implementation of multi-branch attentive Transformer (MAT).
☆33Updated 4 years ago
Lingkai-Kong / Calibrated-BERT-Fine-Tuning
Code for Paper: Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data
☆35Updated 4 years ago
nhhoang96 / ZeroShotCapsule-PyTorch-
PyTorch Implementation of Zero-shot User Intent Detection via Capsule Neural Networks
☆18Updated 5 years ago
szhangtju / The-compression-of-Transformer
☆63Updated 4 years ago