Adaptive embedding and softmax
☆17Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for keras-adaptive-softmax
Users that are interested in keras-adaptive-softmax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transformer-XL with checkpoint loader☆67Jan 22, 2022Updated 4 years ago
- Gradient accumulation for Keras☆35Jun 27, 2021Updated 4 years ago
- Tensorflow NCE loss in Keras☆34Oct 6, 2018Updated 7 years ago
- Ordered Neurons LSTM☆30Jan 22, 2022Updated 4 years ago
- a language model with gated conv nets (implements https://arxiv.org/pdf/1612.08083v1.pdf)☆16Jan 5, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- AdaBound optimizer in Keras☆56Jul 11, 2020Updated 5 years ago
- Transformer implemented in Keras☆368Jan 22, 2022Updated 4 years ago
- Pytorch implementation of Dauphin et al. (2016) "Language Modeling with Gated Convolutional Networks"☆29Jan 10, 2023Updated 3 years ago
- Documentation for Chatstack: A Full Pipeline UI for building Chinese NLU System☆18Sep 7, 2019Updated 6 years ago
- Load GPT-2 checkpoint and generate texts☆127Jan 22, 2022Updated 4 years ago
- A list of all papers related to anomaly detection in NeurIPS 2020.☆10Jan 13, 2021Updated 5 years ago
- Learning rate multiplier☆46Jun 22, 2021Updated 4 years ago
- Keras implementation of AdaBound☆130Nov 4, 2019Updated 6 years ago
- Using Spatial Transformer Layer with keras (theano backend).☆12Jun 7, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Training RNNs as fast as CNNs. An unofficial tensorflow implementation.☆33Feb 23, 2018Updated 8 years ago
- A package of Wide Residual Networks for image recognition in Keras.☆15Jan 22, 2022Updated 4 years ago
- The experiment result of LSTM language models on PTB (Penn Treebank) and GBW (Google Billion Word) using AdaptiveSoftmax on TensorFlow.☆99Oct 17, 2018Updated 7 years ago
- Tencent_AILab_ChineseEmbedding☆12Dec 30, 2018Updated 7 years ago
- Cognitive Computational Neuroscience online Reading Club(CCN0RC)☆12Jul 30, 2021Updated 4 years ago
- Teaching materials for BayesCog workshop, UKE Hamburg (Part 1).☆15Dec 4, 2023Updated 2 years ago
- Generative Adversarial Network with Weight Normalization + ResNet☆22Dec 18, 2017Updated 8 years ago
- ☆10Apr 5, 2022Updated 4 years ago
- Implementation of XLNet that can load pretrained checkpoints☆169Jan 22, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Oct 19, 2022Updated 3 years ago
- Unbounded cache model for online language modeling with open vocabulary☆11Feb 15, 2019Updated 7 years ago
- lookahead optimizer for keras☆169Oct 14, 2019Updated 6 years ago
- Using GPT2 to build a human motion model!☆18Jun 20, 2020Updated 5 years ago
- Deep neural network inference transpiler tool for tflite and NNAPI☆12Jul 16, 2018Updated 7 years ago
- Compositional Abstractions Tutorial☆14Nov 26, 2023Updated 2 years ago
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆16May 22, 2022Updated 3 years ago
- Word semantics Deep Learning with Vanilla Python, Keras, Theano, TensorFlow, PyTorch☆14Apr 25, 2017Updated 9 years ago
- The quantization of CNN/LSTM☆11Mar 26, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Apr 30, 2016Updated 10 years ago
- 提取出判决书中的金额项和金额数。☆11Apr 8, 2016Updated 10 years ago
- Keras implementation of the Information Dropout (arXiv:1611.01353) paper☆15Dec 31, 2016Updated 9 years ago
- Chatbot_CN项目的知识图谱模块☆12Mar 27, 2020Updated 6 years ago
- Recurrent versus Recursive Approaches Towards Compositionality in Semantic Vector Spaces.☆13Sep 22, 2021Updated 4 years ago
- Layer-wise Adaptive Moments optimizer for Batch training☆15Apr 3, 2019Updated 7 years ago
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis☆11Nov 14, 2024Updated last year