Posts by Jeremy Appleyard
Simulation / Modeling / Design
Oct 17, 2017
Programming Tensor Cores in CUDA 9
[caption id="attachment_7875" align="alignright" width="200"] Tensor cores provide a huge boost to convolutions and matrix operations. Tensor cores are...
16 MIN READ
Data Science
Apr 06, 2016
Optimizing Recurrent Neural Networks in cuDNN 5
[caption id="attachment_6585" align="alignright" width="300"] Figure 1: cuDNN 5 + Torch speedup vs. Torch-rnn implementation, M40, Intel® Xeon® Processor...
10 MIN READ
Simulation / Modeling / Design
Aug 07, 2014
CUDA Pro Tip: Optimize for Pointer Aliasing
Often cited as the main reason that naïve C/C++ code cannot match FORTRAN performance, pointer aliasing is an important topic to understand when considering...
6 MIN READ