Cranial-XIX / longhorn
Official PyTorch Implementation of the Longhorn Deep State Space Model
☆40Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for longhorn
- ☆46Updated 5 months ago
- ☆45Updated 9 months ago
- ☆25Updated 3 weeks ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆24Updated 7 months ago
- Stick-breaking attention☆34Updated last week
- [NeurIPS 2023 spotlight] Official implementation of HGRN in our NeurIPS 2023 paper - Hierarchically Gated Recurrent Neural Network for Se…☆61Updated 6 months ago
- DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆57Updated 3 weeks ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆35Updated last year
- ☆29Updated 2 months ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆38Updated 10 months ago
- Minimal but scalable implementation of large language models in JAX☆26Updated 2 weeks ago
- ☆69Updated 8 months ago
- ☆50Updated 6 months ago
- ☆25Updated 3 weeks ago
- ☆28Updated 7 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆39Updated 3 months ago
- Official code for the paper "Attention as a Hypernetwork"☆23Updated 4 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 2 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆80Updated last week
- ☆73Updated 4 months ago
- ☆68Updated 2 months ago
- [CoRL 2024] Official code for "Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models"☆15Updated 3 weeks ago
- ☆45Updated 4 months ago
- ☆24Updated 8 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆49Updated 3 months ago
- ☆22Updated 4 months ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆84Updated last month
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆19Updated 5 months ago
- ☆44Updated last year
- ☆75Updated last year