PC GPU Performance Hot Spots

GameWorks, GameWorks Expert Developer

Matthijs De Smedt, posted Aug 10 2016

Introduction

Game developers shipping titles for PC and consoles are faced with many interesting rendering optimization challenges. The GPU cost of techniques like tile-based lighting or the CPU cost of the rendering API are going to impact performance very differently based on which platform is being used.

Read more

Reading Between The Threads: Shader Intrinsics

GameWorks, GameWorks Expert Developer, DX11, DX12, Vulkan, OpenGL, nvapi, DesignWorks

Mathias Schott, posted Jul 29 2016

Introduction

When writing compute shaders, it’s often necessary to communicate values between threads. This is typically done via shared memory. Kepler GPUs introduced “shuffle” intrinsics, which allow threads of a warp to directly read each other's registers avoiding memory access and synchronization. Shared memory is relatively fast but instructions that operate without using memory of any kind are significantly faster still.

Read more