cyk1337/Highway-Transformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cyk1337/Highway-Transformer)

cyk1337 / Highway-Transformer

[ACL‘20] Highway Transformer: A Gated Transformer.

☆33

Alternatives and similar repositories for Highway-Transformer

Users that are interested in Highway-Transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HazyResearch / embroid
View on GitHub
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Aug 12, 2023Updated 2 years ago
Noahs-ARK / PaLM
View on GitHub
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Jan 7, 2020Updated 6 years ago
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
siyuanseever / llama2Rnn.c
View on GitHub
☆13Apr 15, 2024Updated 2 years ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LZhengisme / CODA
View on GitHub
Implementation of Cascaded Head-colliding Attention (ACL'2021)
☆11Sep 16, 2021Updated 4 years ago
srush / tangent
View on GitHub
Source-to-Source Debuggable Derivatives in Pure Python
☆15Jan 23, 2024Updated 2 years ago
rycolab / aflt-f2023
View on GitHub
Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)
☆10Feb 21, 2023Updated 3 years ago
JRC1995 / Continuous-RvNN
View on GitHub
Official Repository for "Modeling Hierarchical Structures with Continuous Recursive Neural Networks" (ICML 2021)
☆12Aug 18, 2021Updated 4 years ago
yikangshen / megablocks
View on GitHub
☆20May 30, 2024Updated 2 years ago
teffland / ner-expected-entity-ratio
View on GitHub
Implementation and experiments for Partially Supervised NER via Expected Entity Ratio in TACL 2022
☆14Nov 7, 2022Updated 3 years ago
LiyuanLucasLiu / Raw-to-End
View on GitHub
Raw-to-End Name Entity Recognition in Social Media
☆16Oct 16, 2019Updated 6 years ago
proger / nanokitchen
View on GitHub
Parallel Associative Scan for Language Models
☆18Jan 8, 2024Updated 2 years ago
zja-nlp / NAT_with_DAD
View on GitHub
☆10Mar 28, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
LouChao98 / nner_as_parsing
View on GitHub
☆16Mar 22, 2023Updated 3 years ago
TurkuNLP / bert-eval
View on GitHub
☆10Oct 15, 2019Updated 6 years ago
astariul / gibbs
View on GitHub
Scale your ML workers asynchronously across processes and machines
☆13Apr 1, 2025Updated last year
lucidrains / learning-to-expire-pytorch
View on GitHub
An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain
☆34Oct 30, 2020Updated 5 years ago
astariul / encode-attend-navigate-pytorch
View on GitHub
Encode-attend-navigate unofficial Pytorch implementation
☆12Oct 1, 2024Updated last year
acosharma / elita-transformer
View on GitHub
Official Repository for Efficient Linear-Time Attention Transformers.
☆18Jun 2, 2024Updated 2 years ago
xuuuluuu / SynLSTM-for-NER
View on GitHub
Code and models for the paper titled "Better Feature Integration for Named Entity Recognition", NAACL 2021.
☆30Nov 5, 2021Updated 4 years ago
INK-USC / IsoBN
View on GitHub
IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization
☆12Nov 23, 2021Updated 4 years ago
maximzubkov / fft-scan
View on GitHub
Efficient PScan implementation in PyTorch
☆17Jan 2, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
yahshibu / nested-ner-tacl2020-flair
View on GitHub
Implementation of Nested Named Entity Recognition using Flair
☆24Oct 29, 2021Updated 4 years ago
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
RUCAIBox / MPOP
View on GitHub
☆13Jun 16, 2021Updated 5 years ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
DFKI-NLP / lrv
View on GitHub
Layerwise Relevance Visualization in Convolutional Text Graph Classifiers
☆11Jun 2, 2021Updated 5 years ago
Aleph-Alpha-Research / NeurIPS-WANT-submission-efficient-parallelization-layouts
View on GitHub
☆22Dec 15, 2023Updated 2 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
whyNLP / Probabilistic-Transformer
View on GitHub
A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.
☆26Oct 22, 2023Updated 2 years ago
Doraemonzzz / hgru-pytorch
View on GitHub
☆29Jul 9, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated 2 years ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
Gladys-Zhao / mRNN-mLSTM
View on GitHub
Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?
☆17Jan 6, 2021Updated 5 years ago
mcoavoux / mtg
View on GitHub
Statistical discontinuous constituent parsing
☆11Feb 15, 2018Updated 8 years ago
sustcsonglin / disco-pointer
View on GitHub
Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …
☆14Aug 25, 2023Updated 2 years ago
robert-lieck / RBN
View on GitHub
Recursive Bayesian Networks
☆11May 11, 2025Updated last year