Posts by William Hill
Generative AI / LLMs
Sep 19, 2024
Just Released: Torch-TensorRT v2.4.0
Includes C++ runtime support in Windows Support, Enhanced Dynamic Shape support in Converters, PyTorch 2.4, CUDA 12.4, TensorRT 10.1, Python 3.12.
1 MIN READ
Generative AI / LLMs
Jul 02, 2024
Achieving High Mixtral 8x7B Performance with NVIDIA H100 Tensor Core GPUs and NVIDIA TensorRT-LLM
As large language models (LLMs) continue to grow in size and complexity, the performance requirements for serving them quickly and cost-effectively continue to...
9 MIN READ
Generative AI / LLMs
May 20, 2024
Supercharge Generative AI Development with Firebase Genkit, Optimized by NVIDIA RTX GPUs
At Google I/O 2024, Google announced Firebase Genkit, a new open-source framework for developers to add generative AI to web and mobile applications using...
4 MIN READ
Generative AI / LLMs
May 14, 2024
NVIDIA TensorRT 10.0 Upgrades Usability, Performance, and AI Model Support
NVIDIA today announced the latest release of NVIDIA TensorRT, an ecosystem of APIs for high-performance deep learning inference. TensorRT includes inference...
7 MIN READ