kyegomez / Blockwise-Parallel-TransformerView on GitHub
32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.
50Jun 16, 2023Updated 2 years ago

Alternatives and similar repositories for Blockwise-Parallel-Transformer

Users that are interested in Blockwise-Parallel-Transformer are comparing it to the libraries listed below

Sorting:

Are these results useful?