kyegomez / Blockwise-Parallel-Transformer

32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.
45Updated last year

Alternatives and similar repositories for Blockwise-Parallel-Transformer:

Users that are interested in Blockwise-Parallel-Transformer are comparing it to the libraries listed below