Transformers have been all the rage in the NLP community ever since GPT-3 was released and have recently become more well-known to the public after ChatGPT was released. I’m going to keep track of my favorite ways to learn about the Transformer architecture here.

Papers

Blog Posts

YouTube Videos

Courses

Detailed understanding of the problem is in the transformerdeep page.