end-to-end AI

Idealized photo of solar panels and wind turbines in the sunshine, with a city in the background.

May 23, 2023

Strategies for Maximizing Data Center Energy Efficiency

Data centers are an essential part of a modern enterprise, but they come with a hefty energy cost. To complicate matters, energy costs are rising and the need...

8 MIN READ

Apr 27, 2023

End-to-End AI for NVIDIA-Based PCs: Optimizing AI by Transitioning from FP32 to FP16

This post is part of a series about optimizing end-to-end AI. The performance of AI models is heavily influenced by the precision of the computational...

4 MIN READ

Apr 25, 2023

End-to-End AI for NVIDIA-Based PCs: ONNX and DirectML

This post is part of a series about optimizing end-to-end AI. While NVIDIA hardware can process the individual operations that constitute a neural network...

14 MIN READ

Featured image of computer screens in stylized design.

Mar 15, 2023

End-to-End AI for NVIDIA-Based PCs: NVIDIA TensorRT Deployment

This post is the fifth in a series about optimizing end-to-end AI. NVIDIA TensorRT is a solution for speed-of-light inference deployment on NVIDIA hardware....

10 MIN READ

Feb 08, 2023

End-to-End AI for NVIDIA-Based PCs: CUDA and TensorRT Execution Providers in ONNX Runtime

This post is the fourth in a series about optimizing end-to-end AI. As explained in the previous post in the End-to-End AI for NVIDIA-Based PCs series, there...

9 MIN READ

Dec 15, 2022

End-to-End AI for NVIDIA-Based PCs: ONNX Runtime and Optimization

This post is the third in a series about optimizing end-to-end AI. When your model has been converted to the ONNX format, there are several ways to deploy it,...

8 MIN READ

Dec 15, 2022

End-to-End AI for NVIDIA-Based PCs: Transitioning AI Models with ONNX

This post is the second in a series about optimizing end-to-end AI. In this post, I discuss how to use ONNX to transition your AI models from research to...

7 MIN READ

Dec 15, 2022

End-to-End AI for NVIDIA-Based PCs: An Introduction to Optimization

This post is the first in a series about optimizing end-to-end AI. The great thing about the GPU is that it offers tremendous parallelism; it allows you to...

9 MIN READ