titu1994 / tf-sha-rnnLinks
Tensorflow port implementation of Single Headed Attention RNN
☆16Updated 5 years ago
Alternatives and similar repositories for tf-sha-rnn
Users that are interested in tf-sha-rnn are comparing it to the libraries listed below
Sorting:
- Large Scale BERT Distillation☆33Updated 2 years ago
- SNAIL Attention Block for Keras.☆16Updated 5 years ago
- Interpretable Models for NLP using PyTorch☆18Updated 7 years ago
- ☆12Updated 6 years ago
- Implementing activation functions from scratch in Tensorflow.☆36Updated 3 years ago
- Code repo for "Transformer on a Diet" paper☆31Updated 5 years ago
- A supplementary code for Beyond Vector Spaces: Compact Data Representation as Differentiable Weighted Graphs.☆47Updated 5 years ago
- hierarchical convolutional attention networks for text classification☆16Updated 5 years ago
- Tensorflow NCE loss in Keras☆34Updated 6 years ago
- Code for NeurIPS 2019 paper "Hierarchical Optimal Transport for Document Representation"☆54Updated 5 years ago
- Model for learning document embeddings along with their uncertainties☆35Updated last year
- SCoPE: Sentence Content Paragraph Embeddings☆18Updated 5 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆14Updated 5 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 3 years ago
- Python package for graph statistics☆9Updated 4 years ago
- DECAF: Deep Extreme Classification with Label Features☆54Updated 3 years ago
- ☆15Updated 3 years ago
- ECLARE: Extreme Classification with Label Graph Correlations☆42Updated 3 years ago
- Density Order Embeddings☆33Updated 6 years ago
- ☆15Updated 4 years ago
- Stacked Denoising BERT for Noisy Text Classification (Neural Networks 2020)☆32Updated 2 years ago
- Code for our ACL '20 paper "Representation Engineering with Natural Language Explanations"☆29Updated 5 years ago
- ☆26Updated 5 years ago
- Pytorch implementation of Dauphin et al. (2016) "Language Modeling with Gated Convolutional Networks"☆29Updated 2 years ago
- Multitask Learning with Pretrained Transformers☆40Updated 4 years ago
- Minimalistic TensorFlow2+ deep metric/similarity learning library with loss functions, miners, and utils as embedding projector.☆38Updated 2 years ago
- Hyperparameter search for AllenNLP - powered by Ray TUNE☆28Updated 4 months ago
- Source code for "A Lightweight Recurrent Network for Sequence Modeling"☆26Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- ☆66Updated 2 years ago