HazyResearch / safariLinks
Convolutions for Sequence Modeling
☆908Updated last year
Alternatives and similar repositories for safari
Users that are interested in safari are comparing it to the libraries listed below
Sorting:
- Language Modeling with the H3 State Space Model☆521Updated 2 years ago
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆980Updated last year
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …