Computer Vision / Video Analytics

Enhanced Image Analysis with Multidimensional Image Processing

Jul 27, 2022

By Michael Boone, Gigon Bae and Gregory Lee

Discuss (1)

AI-Generated Summary

Dislike

Multidimensional image processing involves analyzing and extracting useful information from image data with two or more dimensions, which is particularly helpful in medical imaging, remote sensing, material science, and microscopy applications.
The challenges of working with multidimensional image data include larger image sizes, increased processing time, and complicated dataset visualization, which can be addressed with efficient software and GPU hardware such as those provided by NVIDIA.
cuCIM, an open-source software library, uses GPU processing power to accelerate computer-vision and image-processing tasks, offering over 200 functions and seamless integration with popular frameworks like PyTorch and TensorFlow.

AI-generated content may summarize information incompletely. Verify important information. Learn more

Image data can generally be described through two dimensions (rows and columns), with a possible additional dimension for the colors red, green, blue (RGB). However, sometimes further dimensions are required for more accurate and detailed image analysis in specific applications and domains.

For example, you may want to study a three-dimensional (3D) volume, measuring the distance between two parts or modeling how that 3D volume changes over time (the fourth dimension). In these instances, you need more than two dimensions to make sense of what you are seeing.

Greater image understanding requires enhanced capability

Multidimensional image processing, or n-dimensional image processing, is the broad term for analyzing, extracting, and enhancing useful information from image data with two or more dimensions. It is particularly helpful and needed for medical imaging, remote sensing, material science, and microscopy applications.

Some methods in these applications may involve data from more channels than traditional grayscale, RGB, or red, green, blue, alpha (RGBA) images. N-dimensional image processing helps you study and make informed decisions using devices enabled with identification, filtering, and segmentation capabilities.

Multidimensional image processing gives you the flexibility to perform functions for traditional two-dimensional filtering in scientific applications. Within medical imaging specifically, computed tomography (CT) and magnetic resonance imaging (MRI) scans require multidimensional image processing to form images of the body and its functions. For example, multidimensional dimensional image processing is used in medical imaging to detect cancer or estimate tumor size (Figure 1).

Multidimensional image processing developer challenges

Outside of identifying, acquiring, and storing the image data itself, working with multidimensional image data comes with its own set of challenges.

First, multidimensional images are larger in size than their 2D counterparts and typically of high resolution, so loading them to memory and accessing them is time-consuming.

Second, processing each additional dimension of image data requires additional time and processing power. Analyzing more dimensions enlarges the scope of consideration.

Third, the computer-vision and image-processing algorithms take longer for analyzing each additional dimension, including the low-level operations and primitives. Multidimensional filters, gradients, and histogram complexity grow with each additional dimension.

Finally, when the data is manipulated, dataset visualization for multidimensional image processing is further complicated by the additional dimensions under consideration and quality to which it must be rendered. In biomedical imaging, the level of detail required can make the difference in identifying cancerous cells and damaged organ tissue.

Multidimensional input/output challenges

If you’re a data scientist or researcher working in multidimensional image processing, you need software that can make data loading and handling for large image files efficient. Popular multidimensional file formats include the following:

NumPy binary format(.npy)
Tag Image File Format (TIFF)
TFRecord (.tfrecord)
Zarr
Variants of the formats listed above

Because every pixel counts, you have to process image data accurately with all the available processing power available. Graphics processing units (GPU) hardware gives you the processing power and efficiency needed to handle and balance the workload of analyzing complex, multidimensional image data in real time.

cuCIM addresses multidimensional image processing challenges

Compute Unified Device Architecture Clara IMage (cuCIM) is an open-source, accelerated, computer-vision and image-processing software library that uses the processing power of GPUs to address the needs and pain points of developers working with multidimensional images.

Data scientists and researchers need software that is fast, easy to use, and reliable for an increasing workload. While specifically tuned for biomedical applications, cuCIM can be used for geospatial, material and life sciences, and remote sensing use cases.

cuCIM offers 200+ computer-vision and image-processing functions for color conversion, exposure, feature extraction, measuring, segmentation, restoration, and transforms.

cuCIM is capable and fast image-processing software, requiring minimal changes to your existing pipeline. cuCIM equips you with enhanced digital image-processing capabilities that can be integrated into existing pipelines:

You can integrate using either a C++ or Python application programming interface (API) that matches OpenSlide for I/O and scikit-image for processing in Python.

The cuCIM Python bindings offer many commonly used, computer-vision, image-processing functions that are easily integratable and compilable into the developer workflow.

You don’t have to learn a new interface or programming language to use cuCIM. In most instances, only one line of code is added for transferring images to the GPU. The cuCIM coding structure is nearly identical to that used for the CPU, so there’s little change needed to take advantage of the GPU-enabled capabilities.

Because cuCIM is also enabled for GPUDirect Storage (GDS), you can efficiently transfer and write data directly from storage to the GPU without making an intermediate copy in host (CPU) memory. That saves time on I/O tasks.

With its quick set-up, cuCIM provides the benefit of GPU-accelerated image processing and efficient I/O with minimal developer effort and with no low-level compute unified device architecture (CUDA) programming required.

Free cuCIM downloads and resources

cuCIM can be downloaded for free through Conda or PyPi. For more information, see the cuCIM developer page. You’ll learn about developer challenges, primitives, and use cases and get links to references and resources.

Discuss (1)

About the Authors

About Michael Boone
Michael Boone is the Manager for Trustworthy AI Product at NVIDIA. He is responsible for building NVIDIA’s technology according to its guiding principles—driving the implementation of products, tools, and processes that enable the company, its customers, and the larger ecosystem to deploy AI with confidence. Beginning his career as a licensed civil engineer, Michael pivoted from transportation infrastructure project management and operations to owning NVIDIA’s global core computer vision product marketing strategy, as well as product feature definition for DRIVE AV. Michael brings a safety-first engineering mindset to the AI frontier, drawing on his background in physical infrastructure to ensure digital systems are built with the same principle and rigor. An inventor and car enthusiast, Michael is a highly trusted collaborator and a leading voice in the deployment of emerging technology across public, private, and research environments.

View all posts by Michael Boone

About Gigon Bae
Gigon Bae is a software engineer at NVIDIA, working on NVIDIA Clara, a healthcare application framework for AI-powered imaging, genomics, and for the development and deployment of smart sensors. He previously worked in the Camera team as a camera software engineer for the Android/Linux camera module on the NVIDIA Jetson/Shield platform. Before joining NVIDIA in 2015, he obtained a Ph.D. from Korea Advanced Institute of Science and Technology (KAIST) with a specialization in software testing and program analysis, with emphasis on the application of automated test case generation techniques to GUI software and unit-level code, and on empirical studies.

View all posts by Gigon Bae

About Gregory Lee
Greg Lee is a software engineer on the NVIDIA Clara team. He earned a Ph.D. in biomedical engineering from the University of Michigan and worked for a number of years in magnetic resonance imaging (MRI) research. With an interest in GPU-accelerated computing, he is a co-creator of RAPIDS cuCIM and frequent contributor to CuPy. He is also involved in the development and maintenance of several open-source scientific Python libraries (PyWavelets, SciPy, scikit-image, and Zarr).

View all posts by Gregory Lee