Attention is All You Need (2017)
Transformer (2017. 6. 12.) 이전에 Alec Radford의 Unsupervised sentiment neuron (2017. 4. 6.)

GPT-1 (2018): https://paperswithcode.com/paper/improving-language-understanding-by

GPT-2 (2019): https://paperswithcode.com/paper/language-models-are-unsupervised-multitask
But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning
Attention in transformers, visually explained | Chapter 6, Deep Learning
7편을 기다리고 있음
2022년에 무척 도움이 됐던,