Attention Is All You Need
The 2017 paper that killed sequential AI and sparked the LLM era.
In 2017, Vaswani et al. published "Attention Is All You Need" — replacing recurrence with a parallel architecture that could finally scale across GPU farms. This bundle unpacks how Transformers work, why the "bank" disambiguation example matters, and why this single paper is responsible for GPT, Claude, and every modern LLM.