Adaptive embedding and softmax
☆17Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for keras-adaptive-softmax
Users that are interested in keras-adaptive-softmax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transformer-XL with checkpoint loader☆67Jan 22, 2022Updated 4 years ago
- Gradient accumulation for Keras☆35Jun 27, 2021Updated 4 years ago
- Tensorflow NCE loss in Keras☆34Oct 6, 2018Updated 7 years ago
- Ordered Neurons LSTM☆30Jan 22, 2022Updated 4 years ago
- ☆11Sep 3, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- AdaBound optimizer in Keras☆56Jul 11, 2020Updated 5 years ago
- Transformer implemented in Keras☆369Jan 22, 2022Updated 4 years ago
- Load GPT-2 checkpoint and generate texts☆127Jan 22, 2022Updated 4 years ago
- A list of all papers related to anomaly detection in NeurIPS 2020.☆10Jan 13, 2021Updated 5 years ago
- Official library of images for the SIGIR 2019 Open-Source IR Replicability Challenge (OSIRRC 2019)☆13Jul 7, 2019Updated 6 years ago
- Learning rate multiplier☆46Jun 22, 2021Updated 4 years ago
- Sampling Matters in Deep Embedding Learning (ICCV'17)☆16Oct 16, 2018Updated 7 years ago
- Keras implementation of AdaBound☆130Nov 4, 2019Updated 6 years ago
- A package of Wide Residual Networks for image recognition in Keras.☆15Jan 22, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Training RNNs as fast as CNNs. An unofficial tensorflow implementation.☆33Feb 23, 2018Updated 8 years ago
- The experiment result of LSTM language models on PTB (Penn Treebank) and GBW (Google Billion Word) using AdaptiveSoftmax on TensorFlow.☆99Oct 17, 2018Updated 7 years ago
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Feb 14, 2023Updated 3 years ago
- Code for Interpretable Adversarial Perturbation in Input Embedding Space for Text, IJCAI 2018.☆42Feb 27, 2020Updated 6 years ago
- Generative Adversarial Network with Weight Normalization + ResNet☆22Dec 18, 2017Updated 8 years ago
- Implementation of XLNet that can load pretrained checkpoints☆169Jan 22, 2022Updated 4 years ago
- A package to perform collaborative filtering on emotion datasets.☆11Jan 8, 2024Updated 2 years ago
- ☆16Jul 25, 2024Updated last year
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆541May 30, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- lookahead optimizer for keras☆169Oct 14, 2019Updated 6 years ago
- Using GPT2 to build a human motion model!☆18Jun 20, 2020Updated 5 years ago
- Keras cropping layer implementation☆13Aug 23, 2016Updated 9 years ago
- Compositional Abstractions Tutorial☆13Nov 26, 2023Updated 2 years ago
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆16May 22, 2022Updated 3 years ago
- Word semantics Deep Learning with Vanilla Python, Keras, Theano, TensorFlow, PyTorch☆14Apr 25, 2017Updated 8 years ago
- The quantization of CNN/LSTM☆11Mar 26, 2017Updated 9 years ago
- ☆11Apr 30, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 提取出判决书中的金额项和金额数。☆11Apr 8, 2016Updated 9 years ago
- Slides from various talks I gave☆18Oct 25, 2018Updated 7 years ago
- Keras implementation of the Information Dropout (arXiv:1611.01353) paper☆15Dec 31, 2016Updated 9 years ago
- parallel corpora for any languages supported by glosbe.com☆10Feb 9, 2016Updated 10 years ago
- Benchmarks of the H2O Ensemble R interface (H2O 2.0).☆14Nov 4, 2020Updated 5 years ago
- Recurrent versus Recursive Approaches Towards Compositionality in Semantic Vector Spaces.☆13Sep 22, 2021Updated 4 years ago
- Layer-wise Adaptive Moments optimizer for Batch training☆15Apr 3, 2019Updated 6 years ago