이 글에서는 트렌스포머 모델의 핵심 메커니즘인 self-Attention 메커니즘에 대해 설명하겠다.https://arxiv.org/abs/1706.03762 Attention Is All You NeedThe dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a newarxiv.org Transformer Model ..