dashstander/block-recurrent-transformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dashstander/block-recurrent-transformer)

dashstander / block-recurrent-transformer

Pytorch implementation of "Block Recurrent Transformers" (Hutchins & Schlag et al., 2022)

☆85

Alternatives and similar repositories for block-recurrent-transformer

Users that are interested in block-recurrent-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / block-recurrent-transformer-pytorch
View on GitHub
Implementation of Block Recurrent Transformer - Pytorch
☆226Aug 20, 2024Updated last year
google-research / meliad
View on GitHub
☆260Jun 6, 2025Updated last year
epfml / pam
View on GitHub
☆16Dec 9, 2023Updated 2 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
pittisl / mPnP-LLM
View on GitHub
Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"
☆13Jan 19, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
IBM / transformers-struct-guidance
View on GitHub
Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"
☆15Sep 17, 2025Updated 10 months ago
jiaqima / CopulaGNN
View on GitHub
CopulaGNN: Towards Integrating Representational and Correlational Roles of Graphs in Graph Neural Networks (ICLR 2021)
☆14Dec 5, 2022Updated 3 years ago
acosharma / elita-transformer
View on GitHub
Official Repository for Efficient Linear-Time Attention Transformers.
☆18Jun 2, 2024Updated 2 years ago
annosubmission / GRC-Cache
View on GitHub
☆16Mar 13, 2023Updated 3 years ago
Timothyxxx / NeuralSymbolicPapers
View on GitHub
☆14Aug 18, 2022Updated 3 years ago
AustenZhu / Deep-Reinforcement-Learning-in-Zipline
View on GitHub
Creating DRL infrastructure for Dynamic Beta with Zipline and Keras
☆14Dec 8, 2022Updated 3 years ago
crossLi / Ultra_light_OCR_No.9
View on GitHub
☆12Jul 8, 2021Updated 5 years ago
boschresearch / Continuous-Recurrent-Units
View on GitHub
☆69Aug 3, 2023Updated 2 years ago
LaurenceA / bayesfunc
View on GitHub
☆14Sep 11, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
NKU-IIPLab / SDRN
View on GitHub
Source code of the paper "Synchronous Double-channel Recurrent Network for Aspect-Opinion Pair Extraction, ACL 2020."
☆12Aug 10, 2020Updated 5 years ago
silverriver / OOD4NLU
View on GitHub
Code for paper "Out-of-domain detection for natural language understanding in dialog systems"
☆10May 27, 2022Updated 4 years ago
google / vsf-time-series
View on GitHub
☆31Jun 17, 2024Updated 2 years ago
yuj-umd / graphRNN
View on GitHub
Codes for the paper "Learning Graph-Level Representations with Gated Recurrent Neural Networks"
☆29Feb 11, 2019Updated 7 years ago
iantbutler01 / ditty
View on GitHub
A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.
☆16Jun 10, 2026Updated last month
PiotrNawrot / dynamic-pooling
View on GitHub
Efficient Transformers with Dynamic Token Pooling
☆68May 20, 2023Updated 3 years ago
chenchen1104 / MERA
View on GitHub
☆21Sep 6, 2025Updated 10 months ago
lucidrains / gated-state-spaces-pytorch
View on GitHub
Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch
☆101Feb 25, 2023Updated 3 years ago
AntNLP / nope_head_scale
View on GitHub
☆29May 4, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
luciferkonn / DT_Mem
View on GitHub
Thisi is the official code base for paper "Think Before You Act: Decision Transformers with Internal Working Memory"
☆23Jul 12, 2024Updated 2 years ago
neulab / cmu-multinlp
View on GitHub
Generalizing Natural Language Analysis through Span-relation Representations
☆91Sep 22, 2025Updated 10 months ago
parallelopipedon / pair-trading-envs
View on GitHub
Cryptocurrency Trading with Reinforcement Learning based on Backtrader
☆47Jan 9, 2025Updated last year
idealab-isu / DSA
View on GitHub
Differentiable Spline Approximations
☆14Sep 3, 2023Updated 2 years ago
Information-Fusion-Lab-Umass / ClinicalNotes_TimeSeries
View on GitHub
The repository for the paper "Predicting in-hospital mortality by combining clinical notes with time-series data"
☆12May 23, 2021Updated 5 years ago
ucinlp / null-prompts
View on GitHub
Code for: "Cutting Down on Prompts and Parameters: Simple Few-Shot Learning with Language Models"
☆19Feb 2, 2022Updated 4 years ago
idanshen / Value-Augmented-Sampling
View on GitHub
☆20May 16, 2024Updated 2 years ago
vanoracai / CoRE
View on GitHub
☆15Jul 14, 2022Updated 4 years ago
Adenialzz / Hello-AIDeployment
View on GitHub
A set of "Hello World" projects of AI deploy frameworks.
☆12Jun 24, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
BackupGithub-AI / LAH
View on GitHub
☆10Mar 28, 2023Updated 3 years ago
fehiepsi / web-traffic-time-series-forecasting
View on GitHub
kaggle competition: https://www.kaggle.com/c/web-traffic-time-series-forecasting
☆16Sep 12, 2017Updated 8 years ago
zhangsunny / GAN-for-Time-Series-in-Pytorch
View on GitHub
使用GAN对时间序列进行建模
☆49Jun 30, 2019Updated 7 years ago
harvardnlp / compound-pcfg
View on GitHub
☆133Sep 17, 2023Updated 2 years ago
RULprediction / RUL-prediction
View on GitHub
Prediction of Remaining Useful Life(RUL) for Aircraft Engine Using Neural Network Models
☆13Dec 7, 2018Updated 7 years ago
anvinhnguyendinh / InferencePGMbyGNN
View on GitHub
A Tensorflow implementation of the paper https://arxiv.org/pdf/1803.07710.pdf
☆14Jun 19, 2019Updated 7 years ago
gao-lab / vConv
View on GitHub
An implementation of vConv layer.
☆11Apr 28, 2021Updated 5 years ago