Transformer Deep Learning Architecture Overview


This video is an introductory overview of the transformer deep learning architecture.  It is the first introductory lecture in the Stanford CS25 seminar on transformers.  It includes an overview of transformers, attention mechanisms, self attention, encoder-decoder architectures, and applications of transformers.


Popular posts from this blog

Simulating the Universe with Machine Learning

CycleGAN: a GAN architecture for learning unpaired image to image transformations

Pix2Pix: a GAN architecture for image to image transformation