Attention is All You Need (2017)

Attention Is All You Need

‘트랜스포머’의 저자들은 요즘 무엇을 하고 있을까?

Transformer (2017. 6. 12.) 이전에 Alec Radford의 Unsupervised sentiment neuron (2017. 4. 6.)

Unsupervised sentiment neuron

Alec Radford는 누구인가?

Untitled

GPT-1 (2018): https://paperswithcode.com/paper/improving-language-understanding-by

Untitled

GPT-2 (2019): https://paperswithcode.com/paper/language-models-are-unsupervised-multitask

3B1B의 자세한 GPT, Transformer 풀이

But what is a GPT? Visual intro to transformers | Chapter 5, Deep Learning

Attention in transformers, visually explained | Chapter 6, Deep Learning

7편을 기다리고 있음

2022년에 무척 도움이 됐던,

The GPT-3 Architecture, on a Napkin