KrisKorrel/sparsemax-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KrisKorrel/sparsemax-pytorch)

KrisKorrel / sparsemax-pytorch

Implementation of Sparsemax activation in Pytorch

☆166

Alternatives and similar repositories for sparsemax-pytorch

Users that are interested in sparsemax-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

msobroza / SparsemaxPytorch
View on GitHub
SparseMax activation function implementation (ICML 2016) (PyTorch)
☆28Nov 30, 2017Updated 8 years ago
vene / sparse-structured-attention
View on GitHub
Sparse and structured neural attention mechanisms
☆224Aug 31, 2020Updated 5 years ago
gokceneraslan / SparseMax.torch
View on GitHub
Implementation attempt of "From Softmax to Sparsemax: A Sparse Model of Attention and Multi-Label Classification"
☆54Dec 17, 2016Updated 9 years ago
deep-spin / entmax
View on GitHub
The entmax mapping and its loss, a family of sparse softmax alternatives.
☆475Jun 22, 2024Updated 2 years ago
deep-spin / sparse-marginalization-lvm
View on GitHub
Official PyTorch (Lightning) implementation of the NeurIPS 2020 paper "Efficient Marginalization of Discrete and Structured Latent Variab…
☆27May 3, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Noahs-ARK / PaLM
View on GitHub
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Jan 7, 2020Updated 6 years ago
YosukeHiguchi / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆16Jan 20, 2025Updated last year
libowen2121 / VI-dependency-syntax
View on GitHub
Dependency Grammar Induction
☆18Feb 11, 2019Updated 7 years ago
HEmile / storchastic
View on GitHub
Stochastic Automatic Differentiation library for PyTorch.
☆210Aug 30, 2024Updated last year
zhaoyanpeng / xcfg
View on GitHub
X (weighted / probabilistic) Context-Free Grammars
☆25Jan 30, 2024Updated 2 years ago
Lingkai-Kong / so-ebm
View on GitHub
Code for paper: End-to-end Stochastic Optimization with Energy-based Model
☆16Feb 14, 2023Updated 3 years ago
js-lee-AI / Awesome_SimultaneousTranslation
View on GitHub
Awesome papers and codes for Simultaneous Machine Translation
☆15Dec 6, 2021Updated 4 years ago
timvieira / vocrf
View on GitHub
Variable-order CRFs with structure learning
☆17Aug 1, 2024Updated last year
hkjeon13 / noising-korean
View on GitHub
한국어 문서에 노이즈를 추가합니다.
☆27Nov 9, 2022Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
vene / sparsemap
View on GitHub
SparseMAP: differentiable sparse structure inference
☆112Feb 10, 2019Updated 7 years ago
SimengSun / ChapterBreak
View on GitHub
☆12Jun 5, 2024Updated 2 years ago
EleutherAI / rnngineering
View on GitHub
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆33May 25, 2024Updated 2 years ago
yikangshen / Ordered-Memory
View on GitHub
This repository contains the code used for Ordered Memory paper
☆29Jan 12, 2020Updated 6 years ago
vy007vikas / PyTorch-PtrNet
View on GitHub
PyTorch implementation of PtrNet to solve sorting problem.
☆12Dec 19, 2017Updated 8 years ago
jxhe / struct-learning-with-flow
View on GitHub
PyTorch Implementation of "Unsupervised Learning of Syntactic Structure with Invertible Neural Projections" (EMNLP 2018)
☆67Feb 19, 2020Updated 6 years ago
ghazi-f / QKVAE
View on GitHub
Implementation of QKVAE
☆11Feb 24, 2023Updated 3 years ago
nec-research / tf-imle
View on GitHub
Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation
☆68Apr 1, 2022Updated 4 years ago
nec-research / st_tau
View on GitHub
This repository contains code for the paper "Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs" (Wang, Lawrence…
☆17Mar 8, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
FranxYao / RDP
View on GitHub
Implementation of ICML 22 Paper: Scaling Structured Inference with Randomization
☆13Jul 24, 2022Updated 3 years ago
ritheshkumar95 / pytorch-normalizing-flows
View on GitHub
☆17May 28, 2018Updated 8 years ago
teffland / ner-expected-entity-ratio
View on GitHub
Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022
☆14Nov 7, 2022Updated 3 years ago
bhattbhavesh91 / few-shot-learning-using-gpt-neo
View on GitHub
Few Shot Learning using EleutherAI's GPT-Neo an Open-source version of GPT-3
☆19Jul 8, 2021Updated 5 years ago
forkunited / ltprg
View on GitHub
☆13Jun 3, 2019Updated 7 years ago
Beomi / transformers-language-modeling
View on GitHub
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23May 20, 2021Updated 5 years ago
i-machine-think / awesome-compositionality
View on GitHub
A list of resources dedicated to compositionality
☆14Feb 21, 2019Updated 7 years ago
ermongroup / fast_feedforward_computation
View on GitHub
Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021
☆30Sep 25, 2021Updated 4 years ago
tunib-ai / transformers
View on GitHub
🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed
☆31Feb 5, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
farbodtm / reptile-pytorch
View on GitHub
PyTorch implementation of OpenAI's REPTILE Algorithm
☆26May 8, 2018Updated 8 years ago
Lyusungwon / generative_models_pytorch
View on GitHub
Implementation of various generative models
☆14Oct 1, 2018Updated 7 years ago
AlirezaMorsali / MLP-Attention
View on GitHub
☆17Dec 19, 2024Updated last year
JiaxunCai / Dynet-Biaffine-SRL
View on GitHub
☆11Aug 14, 2018Updated 7 years ago
zengyan-97 / Transformer-DST
View on GitHub
A Generative Dialogue State Tracking Model
☆23Jun 24, 2021Updated 5 years ago
facebookresearch / higher
View on GitHub
higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…
☆1,629Mar 25, 2022Updated 4 years ago
mstrise / dep2label-up
View on GitHub
Dependency Parsing as Sequence Labeling with Python3+ and PyTorch1+ and MTL
☆10Nov 21, 2019Updated 6 years ago