Efficient Self-Attention for Transformers

Рет қаралды 3,323

The memory and computational demands of the original attention mechanism increase quadratically as sequence length grows, rendering it impractical for longer sequences.
However, various methods have been developed to streamline the attention mechanism's complexity. In this video, we'll explore some of the most prominent models that address this challenge.
#transformers
Link to the activation function video:
A Review of 10 Most Popular Activation Functions in Neural Networks
• A Review of 10 Most Po...

Жүктеу

Пікірлер: 11

@javadkhataei970
10 ай бұрын
Very informative. Thank you!
@PyMLstudio
10 ай бұрын
Glad it was helpful!
@benji6296
3 ай бұрын
what would be the advantage of this methods vs Flash attention. Flash attention speeds up the computation and it is an exact computation most of these methods are approximations. I would like if possible to see a video explaining other attention types as Paged attention and Flash Attention. Great content :)
@PyMLstudio
3 ай бұрын
Thank you for the suggestion! You're absolutely right. In this video, I focused on purely algorithmic approaches, not hardware-based solutions like FlashAttention. FlashAttention is an IO-aware exact attention algorithm that uses tiling to reduce memory reads/writes between GPU memory levels, which results in significant speedup without sacrificing model quality. I appreciate your input and will definitely consider making a video to explain FlashAttention!
@PyMLstudio
Ай бұрын
Thanks for the suggestion, I made a new video on Flash Attention: FlashAttention: Accelerate LLM training kzitem.info/news/bejne/rYGt33yNknd6gIY I would love to hear your comments and if you have any other suggestions
@buh357
4 ай бұрын
you should include axial attention and axial position embedding, its simple yet work great on image, and video.
@PyMLstudio
4 ай бұрын
Thanks for the suggestion, yes I agree. I have briefly described axial attention in the vision transformer series kzitem.info/news/bejne/w5eszJWVi6RjmWksi=0SB9Yc_0SasafhJN
@buh357
4 ай бұрын
@@PyMLstudio thats awesome, thanks you!
@brianlee4966
6 ай бұрын
Thank you so much
@pabloealvarez
10 ай бұрын
good explanation, very clear
@PyMLstudio
10 ай бұрын
Thank you for the nice comment! Glad you find the videos useful!