2024
7
21
Running LLMs on cpus by Rust
Attention
MLP
22
Reproducing GPT-2
Reproducing GPT-2 by C
2025
3
16
How to reset a git repository to the initial state without deleting data
Git submodule
Basic concepts
Mathematical Expression
Examples
References
Attention
2024/07/21
Basic concepts
input sequenceに対し、各inputベクトルへの適切な重み付き和を学習
Q, K, Vなる3つの行列を学習、inputからquery、key、valueを計算し上記の重みつき和を計算
self-attentionの場合、Q、K、Vに同一の行列を使用
Mathematical Expression
scaled dot-product attention
Examples
References
Attention is All You Need
Copyright © 2025 All rights reserved.