The final diagram [1] is also much less "readable" than the original figures from the paper.
[1] https://dugas.ch/artificial_curiosity/img/GPT_architecture/f...
The final diagram [1] is also much less "readable" than the original figures from the paper.
[1] https://dugas.ch/artificial_curiosity/img/GPT_architecture/f...