kyegomez / SimplifiedTransformers

SimplifiedTransformer simplifies transformer block without affecting training. Skip connections, projection parameters, sequential sub-blocks, and normalization layers are removed. Experimental results confirm similar training speed and performance.
14Updated 2 weeks ago

Alternatives and similar repositories for SimplifiedTransformers:

Users that are interested in SimplifiedTransformers are comparing it to the libraries listed below