CUDA on Turing Opens New GPU Compute Possibilities

Accelerated Computing, CUDA, Game Development, RTX 2070, trie, Turing, Volta

Nadeem Mohammad, posted Nov 07 2018

The Turing architecture introduces so many cool new features that it’s easy to miss the quiet revolution in GPU programming that it also represents: all of the features introduced with Volta now exist in a GeForce product.

Read more

Introducing Apex: PyTorch Extension with Tools to Realize the Power of Tensor Cores

Artificial Intelligence, Features, Computer Vision & Machine Vision, cuDNN, Development Tools & Libraries, Image Processing, Image Recognition, Machine Learning & Artificial Intelligence, Tesla, Volta

Nadeem Mohammad, posted Jun 19 2018

Today at the Computer Vision and Pattern Recognition Conference in Salt Lake City, Utah, NVIDIA is kicking off the conference by demonstrating an early release of Apex, an open-source PyTorch extension that helps users maximize deep learning train

Read more

A Trio of New Nsight Tools That Empower Developers to Fully Optimize their CPU and GPU Performance

Game Development, CUDA, Development Tools & Libraries, Games, Volta

Nadeem Mohammad, posted May 31 2018

Three big NVIDIA Nsight releases on the same day! NSight Systems is a brand new optimization tool; Nsight Visual Studio Edition 5.6 extends support to Volta GPUs and Win10 RS4; and NSight GRAPHICS 1.2 replaces the current Linux Graphics Debugger.

Read more

OpenSeq2Seq: New Toolkit for Distributed and Mixed-Precision Training of Sequence-to-Sequence Models

Artificial Intelligence, Machine Learning & Artificial Intelligence, Volta

Nadeem Mohammad, posted Apr 21 2018

Researchers at NVIDIA open-sourced v0.2 of OpenSeq2Seq – a new toolkit built on top of TensorFlow for training sequence-to-sequence models.

Read more

Microsoft Releases New Version of High-Performance, Open-Source, Deep Learning Toolkit

News, Research, Higher Education / Academia, Image Recognition, Machine Learning & Artificial Intelligence, Volta

Nadeem Mohammad, posted Jun 01 2017

Previously known as CNTK, the Microsoft Cognitive Toolkit version 2.0 allows developers to create, train, and evaluate their own neural networks that can scale across multiple GPUs and multiple machines on massive data sets.

Read more