The Idea of Attention
Let the model focus on what matters.
Why Attention?
When you read a sentence, you do not weigh every word equally. Attention gives a model that same power to focus on what matters. 🎯
The Bottleneck Problem
Older models squeezed a whole sentence into one fixed vector. That bottleneck lost detail, especially for long inputs that carry many ideas.