eggie5 / NCE-lossLinks
Tensorflow NCE loss in Keras
☆34Updated 7 years ago
Alternatives and similar repositories for NCE-loss
Users that are interested in NCE-loss are comparing it to the libraries listed below
Sorting:
- Efficient Transformers for research, PyTorch and Tensorflow using Locality Sensitive Hashing☆96Updated 6 years ago
- SNAIL Attention Block for Keras.☆17Updated 5 years ago
- Pytorch implementation of Dauphin et al. (2016) "Language Modeling with Gated Convolutional Networks"☆29Updated 3 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Updated 6 years ago
- Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.☆38Updated 3 years ago
- Tutorial for Multi-Stakeholder Recommender Systems☆22Updated 4 years ago
- Keras implementation of “Gated Linear Unit ”☆23Updated last year
- Hash Embedding code for the paper "Hash Embeddings for Efficient Word Representations"☆42Updated 8 years ago
- ☆40Updated 7 years ago
- Machine-generated summaries and highlights of the every accepted paper at Thirty-second Conference on Neural Information Processing Syste…☆71Updated 7 years ago
- hierarchical convolutional attention networks for text classification☆16Updated 6 years ago
- Storage for Kaggle Quora competition☆16Updated 8 years ago
- kaggle competition: https://www.kaggle.com/c/web-traffic-time-series-forecasting☆16Updated 8 years ago
- Large Scale BERT Distillation☆33Updated 2 years ago
- LSTM and Hierarchical Attention Network on DSVM☆41Updated 8 years ago
- Attention based sequence to sequence neural machine translation model built in keras.☆30Updated 7 years ago
- Personalized Query Completion☆27Updated 5 years ago
- Sentiment analysis with variable length sequences in pytorch☆34Updated 6 years ago
- Reproducing Character-Level-Language-Modeling with Deeper Self-Attention in PyTorch☆62Updated 7 years ago
- Density Order Embeddings☆33Updated 6 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- Adaptive embedding and softmax☆17Updated 4 years ago
- Sequence to Sequence Models in PyTorch☆44Updated last year
- Code repo for "Transformer on a Diet" paper☆31Updated 5 years ago
- Implementing Skip-gram Negative Sampling with pytorch☆49Updated 7 years ago
- Augmenting word embeddings with their surrounding context using bidirectional RNN☆60Updated 5 years ago
- Mixture of experts layers for Keras☆94Updated 7 years ago
- ☆15Updated 5 years ago
- Discover relevant information about categorical data with entity embeddings using Neural Networks (powered by Keras)☆70Updated 3 years ago
- Quasi-RNN for language modeling☆57Updated 9 years ago