SparseMax activation function implementation (ICML 2016) (PyTorch)
☆28Nov 30, 2017Updated 8 years ago
Alternatives and similar repositories for SparsemaxPytorch
Users that are interested in SparsemaxPytorch are comparing it to the libraries listed below
Sorting:
- Implementation of Sparsemax activation in Pytorch☆165May 27, 2020Updated 5 years ago
- ☆19Sep 29, 2019Updated 6 years ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- ☆16May 14, 2024Updated last year
- Implementation attempt of "From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification"☆54Dec 17, 2016Updated 9 years ago
- Sparse and structured neural attention mechanisms☆225Aug 31, 2020Updated 5 years ago
- Spell and pronounce words with a neural network☆10Feb 13, 2017Updated 9 years ago
- The code for the models described in "Learning Tasks for Multitask Learning: Heterogenous Patient Populations in the ICU" (KDD 2018).☆21May 22, 2020Updated 5 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Feb 12, 2018Updated 8 years ago
- Predicting Biomedical Interactions with Higher-Order Graph Convolutional Networks☆16Nov 9, 2021Updated 4 years ago
- PyTorch Implementation of Residual Attention Network for Semantic Segmentation.☆23Nov 1, 2017Updated 8 years ago
- Official PyTorch implementation of the paper : ProbAct: A Probabilistic Activation Function for Deep Neural Networks.☆13Jun 10, 2019Updated 6 years ago
- Official Repository for Efficient Linear-Time Attention Transformers.☆18Jun 2, 2024Updated last year
- Implementation of "Multi-modal Retrieval Augmented Multi-modal Generation: Datasets, Evaluation Metrics and Strong Baselines"☆31Feb 24, 2025Updated last year
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Dec 5, 2021Updated 4 years ago
- conll-formatted-ontonotes-5.0 for chinese language☆11Jan 9, 2019Updated 7 years ago
- DyNet implementation of stack LSTM experiments by Grefenstette et al.☆21Oct 6, 2017Updated 8 years ago
- Raw-to-End Name Entity Recognition in Social Media☆16Oct 16, 2019Updated 6 years ago
- Data-free knowledge distillation using Gaussian noise (NeurIPS paper)☆15Mar 24, 2023Updated 2 years ago
- python metric functions, such as MAP, NDCG, AUC...☆10Jul 25, 2014Updated 11 years ago
- Source code for our journal submission : ELD-Net: An efficient deep learning architecture for accurate saliency detection☆10Nov 27, 2017Updated 8 years ago
- Deep Supervised Graph Partitioning Model☆14Aug 3, 2021Updated 4 years ago
- Implementation of the paper: "High-dimensional Bayesian optimization using low-dimensional feature spaces".☆21Sep 11, 2020Updated 5 years ago
- This repository provides code source used in the paper: A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off☆13May 30, 2019Updated 6 years ago
- This repository contains the code used for Ordered Memory paper☆29Jan 12, 2020Updated 6 years ago
- Reversible Recurrent Neural Network Pytorch Implementation☆21Dec 6, 2017Updated 8 years ago
- Synthesizer Self-Attention is a very recent alternative to causal self-attention that has potential benefits by removing this dot product…☆14Dec 29, 2024Updated last year
- Keras-like APIs for JAX framework☆50Mar 25, 2023Updated 2 years ago
- Mathematical consequences of orthogonal weights initialization and regularization in deep learning. Experiments with gain-adjusted orthog…☆17Sep 21, 2019Updated 6 years ago
- [ICME 2019] Source code and datasets for "Semi-supervised Compatibility Learning Across Categories for Clothing Matching"☆10Apr 26, 2024Updated last year
- ☆17Oct 25, 2016Updated 9 years ago
- Code for paper "Incorporating Multimodal Information in Open-Domain Web Keyphrase Extraction"☆19Jan 28, 2021Updated 5 years ago
- Code for "Salient Deconvolutional Networks, Aravindh Mahendran, Andrea Vedaldi, ECCV 2016"☆12Sep 28, 2016Updated 9 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- ☆19Jan 13, 2021Updated 5 years ago
- Code for the paper "Generative Modeling of Infinite Occluded Objects for Compositional Scene Representation"☆10Feb 4, 2023Updated 3 years ago
- Progressive Attention Networks☆12Oct 25, 2016Updated 9 years ago
- A tool to download and format PASCAL VOC 2007 dataset for multilabel classification☆10Jul 17, 2017Updated 8 years ago