SparseMax activation function implementation (ICML 2016) (PyTorch)
☆28Nov 30, 2017Updated 8 years ago
Alternatives and similar repositories for SparsemaxPytorch
Users that are interested in SparsemaxPytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14May 14, 2019Updated 6 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- ☆12Jan 29, 2021Updated 5 years ago
- ☆16May 14, 2024Updated last year
- Implementation attempt of "From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification"☆54Dec 17, 2016Updated 9 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Sparse and structured neural attention mechanisms☆224Aug 31, 2020Updated 5 years ago
- Spell and pronounce words with a neural network☆10Feb 13, 2017Updated 9 years ago
- The code for the models described in "Learning Tasks for Multitask Learning: Heterogenous Patient Populations in the ICU" (KDD 2018).☆21May 22, 2020Updated 5 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- PyTorch Implementation of Residual Attention Network for Semantic Segmentation.☆23Nov 1, 2017Updated 8 years ago
- Implementation for our paper exploring a novel 2D adaptive attention span kernel in computer vision.☆35Oct 3, 2023Updated 2 years ago
- An updated version of eICU Benchmark with an updated problem definition on LoS and Decompensation tasks☆12Aug 12, 2021Updated 4 years ago
- Official PyTorch implementation of the paper : ProbAct: A Probabilistic Activation Function for Deep Neural Networks.☆13Jun 10, 2019Updated 6 years ago
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Dec 5, 2021Updated 4 years ago
- For the paper: "Semi-Supervised Structured Prediction with Neural CRF Autoencoder"☆26Aug 7, 2017Updated 8 years ago
- conll-formatted-ontonotes-5.0 for chinese language☆11Jan 9, 2019Updated 7 years ago
- DyNet implementation of stack LSTM experiments by Grefenstette et al.☆21Oct 6, 2017Updated 8 years ago
- Source code for our journal submission : ELD-Net: An efficient deep learning architecture for accurate saliency detection☆10Nov 27, 2017Updated 8 years ago
- The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.☆20Jun 16, 2023Updated 2 years ago
- Self-Distillation with weighted ground-truth targets; ResNet and Kernel Ridge Regression☆19Oct 12, 2021Updated 4 years ago
- Code for "A Neural Transition-based Model for Nested Mention Recognition"☆36Aug 21, 2018Updated 7 years ago
- Implementation of the paper: "High-dimensional Bayesian optimization using low-dimensional feature spaces".☆21Sep 11, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This repository provides code source used in the paper: A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off☆13May 30, 2019Updated 6 years ago
- implementing Weight Agnostic Neural Networks to Spiking Neural Networks☆10Jan 26, 2021Updated 5 years ago
- This repository contains the code used for Ordered Memory paper☆29Jan 12, 2020Updated 6 years ago
- Reversible Recurrent Neural Network Pytorch Implementation☆21Dec 6, 2017Updated 8 years ago
- Distributed Tensorflow Implementation of Asynchronous DDPG☆12Oct 25, 2017Updated 8 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- Keras-like APIs for JAX framework☆50Mar 25, 2023Updated 3 years ago
- Code & Experiments for "LILA: Language-Informed Latent Actions" to be presented at the Conference on Robot Learning (CoRL) 2021.☆13Nov 4, 2021Updated 4 years ago
- Mathematical consequences of orthogonal weights initialization and regularization in deep learning. Experiments with gain-adjusted orthog…☆17Sep 21, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Codebase for "Decoding language spatial relations to 2D spatial arrangements" (Findings of EMNLP 2020).☆11Feb 10, 2023Updated 3 years ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆24Jun 23, 2020Updated 5 years ago
- ☆17Oct 25, 2016Updated 9 years ago
- Code for paper "Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction"☆19Jan 28, 2021Updated 5 years ago
- HANA XS Advanced Python Buildpack and example multi-target-application (This Repository has been archived upon Members choice)☆10Feb 19, 2020Updated 6 years ago
- Tensorflow Implementation of adversarial learning based adversarial example generator☆10Jan 31, 2018Updated 8 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 5 years ago