Transformer Deep Learning Architecture Overview

 

This video is an introductory overview of the transformer deep learning architecture.  It is the first introductory lecture in the Stanford CS25 seminar on transformers.  It includes an overview of transformers, attention mechanisms, self attention, encoder-decoder architectures, and applications of transformers.


Comments

Popular posts from this blog

CycleGAN: a GAN architecture for learning unpaired image to image transformations

Pix2Pix: a GAN architecture for image to image transformation

Smart Fabrics