eggie5 / NCE-loss
Tensorflow NCE loss in Keras
☆34Updated 6 years ago
Alternatives and similar repositories for NCE-loss:
Users that are interested in NCE-loss are comparing it to the libraries listed below
- Tensorflow port implementation of Single Headed Attention RNN☆16Updated 5 years ago
- Pytorch implementation of Dauphin et al. (2016) "Language Modeling with Gated Convolutional Networks"☆29Updated 2 years ago
- Adaptive embedding and softmax☆17Updated 3 years ago
- Keras implementation of “Gated Linear Unit ”☆23Updated 11 months ago
- Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.☆37Updated 2 years ago
- A Keras implementation of Adaptive Softmax☆7Updated 6 years ago
- Introduction Notebook to Extreme Multi-Label Classification problem (XML)☆22Updated 6 years ago
- Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…☆45Updated 5 years ago
- A Tensorflow implementation of Yin Wenpeng's recent paper on TACL "Attentive Convolution"☆33Updated 6 years ago
- Quasi-RNN for language modeling☆57Updated 8 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆14Updated 4 years ago
- WARP loss for Pytorch as described by the paper: WSABIE: Scaling Up To Large Vocabulary Image Annotation☆44Updated 2 years ago
- Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing☆94Updated 5 years ago
- Quasi-Recurrent Neural Network (QRNN) for Tensorflow☆23Updated 6 years ago
- Multiplicative LSTM for Recommendations☆20Updated 6 years ago
- Partial Codes and datasets for NeurIPS'19 "Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers"☆19Updated 5 years ago
- SNAIL Attention Block for Keras.☆16Updated 5 years ago
- Position embedding layers in Keras☆58Updated 3 years ago
- Experiments using feedforward networks with attention☆47Updated 8 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- Attention based sequence to sequence neural machine translation model built in keras.☆30Updated 7 years ago
- hierarchical convolutional attention networks for text classification☆16Updated 5 years ago
- Personalized Query Completion☆27Updated 4 years ago
- Hash Embedding code for the paper "Hash Embeddings for Efficient Word Representations"☆42Updated 7 years ago
- Unsupervised Anomaly Detection via Deep Metric Learning with End-to-End Optimization☆12Updated 2 years ago
- An implementation of MixMatch with PyTorch☆36Updated 4 years ago
- Collection of TensorFlow Examples☆37Updated 6 years ago
- ☆24Updated 4 years ago
- Interpretable Models for NLP using PyTorch☆18Updated 7 years ago
- ☆18Updated 7 years ago