Cite
Notes
Only stored in your browser.
Attribution
TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax
arXiv 2024
Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers
arXiv 2023
from 2 papers
Andreas Dengel
Tobias Christian Nauen
Federico Raue