Transformer Deep Learning Architecture Overview

 

This video is an introductory overview of the transformer deep learning architecture.  It is the first introductory lecture in the Stanford CS25 seminar on transformers.  It includes an overview of transformers, attention mechanisms, self attention, encoder-decoder architectures, and applications of transformers.


Comments