Today's NVIDIA GTC 2020 Announcements

Parts 1 through 9 of the NVIDIA GTC 2020 KeyNote presentation are now available for your viewing pleasure.  Lots of fun new things to check out from NVIDIA.
Here's Part 1 of the 9 presentations. Note that the conference is happening remotely, as Jensen Huang gives his keynote talk from his home kitchen where he is for some reason wearing a black leather jacket.

So we recently discussed NVIDIA's Jetson Nano single board accelerators when we looked at embedded deployment solutions for deep learning neural nets. 

There is now a new Jetson Xavier NX module that is backwards compatible with Jetson Nano.
It includes the following features:
An integrated NVIDIA Volta 384-core Volta GPU with 48 Tensor Cores
Two NVIDIA Deep Learning Accelerator engines
Seven-way VLIW Vision Accelerator
Six-core NVIDIA Carmel 64-bit ARMv8.2 CPU
8-GB 128-bit LPDDR4x

Jetson Xavier now includes what NVIDIA calls 'cloud native technologies'.  So things like Docker based containerization and Kubernetes. The Jetson Xavier NX Developer kit is available now for $399. Only 5 per customer, so you will be constrained when you try to use those Kubernetes features to build your own custom GPU cluster.

New Application frameworks were also announced for 'conversational AI services' (fancy term for AI-Bot) and deep learning recommendation systems.  So AI systems that conversationally interact with people.

CUDA Toolkit 11 was also announced.  Here's a more detailed 'features revealed' post that talks about all of the new CUDA 11 features. Note conspicuous absence of Apple from the supported platforms table.

A new Expressive and Natural Speech Synthesis Model called Flowtron was also announced.  Flowtron does the voice narration for embedded commercial part of the keynote above (2.29 timestamp).  Here's a preprint paper that talks about the model.

Flowtron learns an invertible mapping of data to a latent space that can be manipulated to control many aspects of speech synthesis (pitch, tone, speech rate, cadence, accent). You also have control over speech variation and style transfer.


Popular posts from this blog

Pix2Pix: a GAN architecture for image to image transformation

CycleGAN: a GAN architecture for learning unpaired image to image transformations

Smart Fabrics