Attention, please!

Demystifying the math behind the attention mechanism and the transformer model

Learning the hard way

Reinforcement Learning Saga — Part I: From zero to Q-learning