Computer Vision / Video Analytics

NVIDIA DeepStream 7.0 Milestone Release for Next-Gen Vision AI Development

May 14, 2024

By Carlos Garcia-Sierra and Debraj Sinha

Discuss (0)

AI-Generated Summary

Dislike

The latest release of NVIDIA DeepStream 7.0 introduces new features such as DeepStream libraries through Python APIs and the DeepStream Service Maker to simplify application development.
DeepStream Service Maker abstracts the complexities of GStreamer, enabling developers to build C++ object-oriented applications efficiently and manage them through REST APIs.
NVIDIA DeepStream 7.0 also brings enhancements to Single-View 3D Tracking, support for BEVFusion with the DeepStream 3D framework, and PipeTuner 1.0 for streamlined AI pipeline optimization.

AI-generated content may summarize information incompletely. Verify important information. Learn more

NVIDIA DeepStream is a powerful SDK that unlocks GPU-accelerated building blocks to build end-to-end vision AI pipelines. With more than 40+ plugins available off-the-shelf, you can deploy fully optimized pipelines with cutting-edge AI Inference, object tracking, and seamless integration with popular IoT message brokers such as REDIS, Kafka, and MQTT.

DeepStream offers intuitive REST APIs to control your AI pipelines whether deployed at the far edge or the cloud.

Diagram shows a DeepStream SDK optimizing video streaming pipeline. — *Figure 1. DeepStream SDK workflow*

The latest release of DeepStream 7.0 is one of our most significant releases to date, crafted to empower you with groundbreaking capabilities in the era of generative AI. This release is packed with innovative features designed to accelerate the development of next-generation applications.

Release highlights include the following:

A new development pathway using new DeepStream libraries through Python APIs
Simplified application development with the new Service Maker
Enhanced features of Single-View 3D tracker
Support for sensor fusion model BEVFusion with the DeepStream 3D framework.
Support for Windows Subsystem for Linux (WSL2)
Streamlined AI pipeline optimization with PipeTuner

Download DeepStream version 7.0 today.

DeepStream libraries: Expanding developer horizons

When building vision AI applications, the first order of business is optimizing AI pipelines for top-notch performance. Whether you’re a seasoned pro or just diving in, understanding the strategic landscape is key.

Broadly, you have two pivotal approaches to choose from:

Existing out-of-the-box frameworks such as GStreamer.
Functional APIs that accelerate key building blocks on your own framework.

Figure 3 shows the structure of a DeepStream plugin. At its core, each plugin encapsulates its essential functionality within a library, accessible through a well-defined interface that aligns with the GStreamer plugin specification.

This standardized approach ensures seamless compatibility and integration within the GStreamer ecosystem. DeepStream adds zero-memory copy between plugins, enabling a state-of-the-art performance.

With the launch of DeepStream 7.0, NVIDIA is excited to open new pathways for developers, offering the flexibility to continue harnessing the power of GStreamer or to tap into the robust capabilities of DeepStream libraries through intuitive Python APIs. This dual approach not only broadens accessibility to NVIDIA acceleration capabilities to Python developers but also integrates seamlessly into your existing AI frameworks.

DeepStream Libraries, powered by NVIDIA CV-CUDA, NvImageCodec, and PyNvVideoCodec, offer a set of low-level, GPU-accelerated operations that are easy drop-in replacements to CPU-bottlenecked equivalents in pre– and post-processing stages of vision AI pipelines.

As open-source libraries, they provide complete transparency and the tools necessary to implement zero-memory copy interactions among the libraries and with popular deep-learning frameworks. Setting up is a pip installation command, streamlining the integration process.

The two paths now supported with DeepStream 7.0 have inherent benefits and tradeoffs:

Easy to learn and integrate: DeepStream Libraries simplify the learning curve, enabling you to integrate Python APIs swiftly and witness the immediate benefits of GPU acceleration. A prime example is the integration with DeepStream Libraries codecs, where the impact of accelerated decode/encode of images or video frames is quickly evident after a few lines of code.
Ready-to-use solutions: For starting from scratch or without an existing pipeline framework, the mature DeepStream plugins combined with the GStreamer framework offer a rapid route to market deployment. These plugins come with built-in zero memory copy and sophisticated resource management, making them an ideal choice for efficient application development.

Radial diagram that includes resource management, support for zero-memory copy, off-the-shelf building blocks, speed to first result, and ease of integration with existing software. — *Figure 5. Tradeoffs between DeepStream libraries and plugins*

Looking ahead, NVIDIA plans to continuously expand the range of supported DeepStream libraries further enriching the developer experience with each new release.

For more information, see the following resources:

DeepStream Service Maker: Simplifying Application Development

We also have fantastic news for DeepStream developers who want to leverage GStreamer. For those new to GStreamer, it can pose a steep learning curve.

NVIDIA is thrilled to unveil a groundbreaking feature in the DeepStream technology suite: DeepStream Service Maker. This new addition simplifies the development process significantly by abstracting the complexities of GStreamer, enabling everyone to build C++ object-oriented applications efficiently in a heartbeat.

With DeepStream Service Maker, you can quickly construct a pipeline, integrate desired plugins, seamlessly link them, and launch applications—all within a matter of minutes. These applications can then be effortlessly packaged into containers and managed through REST APIs, offering a streamlined workflow that reduces the traditional coding effort dramatically.

Pipeline pipeline("deepstream-test1");
    pipeline.add("filesrc", "src", "location", argv[1])
        .add("h264parse", "parser")
        .add("nvv4l2decoder", "decoder")
        .add("nvstreammux", "mux", "batch-size", 1, "width", 1280, "height", 720)
        .add("nvinfer", "infer", "config-file-path", CONFIG_FILE_PATH)
        .add("nvvideoconvert", "converter")
        .add("nvdsosd", "osd")
        .add(sink, "sink")
        .link("src", "parser", "decoder")
        .link({"decoder", "mux"}, {"", "sink_%u"})
        .link("mux", "infer", "converter", "osd", "sink")
        .attach("infer", new BufferProbe("counter", new ObjectCounter))
        .attach("infer", "sample_video_probe", "my probe", "src", "font-size", 20)
        .start()
        .wait();

DeepStream Service Maker makes the development process more intuitive for those unfamiliar with GStreamer and also unlocks new capabilities for experienced developers. It fully supports custom plugins, which is crucial if you’ve invested time in creating custom solutions over the years.

By transforming complex coding requirements from hundreds of lines into just a few, DeepStream Service Maker revolutionizes how you approach and manage application development, making it easier and more accessible than ever.

DeepStream Service Maker also accelerates application development for edge environments as it is the ideal path to develop your own microservices for Metropolis Microservices for Jetson (MMJ). DeepStream Service Maker can also be easily deployed as microservices on the cloud that can be controlled via REST-APIs.

Video 1. Build, Deploy, and Control Vision AI Apps Easily

When your application is built with Service Maker, it can be easily packaged into a container and then managed and dynamically controlled through intuitive REST APIs such as stream addition and deletion, and region-of-interest (ROI) configuration.

Video 2. Controlling a Vision AI Application using REST APIs

This first release of DeepStream Service Maker supports C++. Python support will be available in a future release, broadening the tool’s accessibility and versatility.

For more information, see the DeepStream Service Maker documentation.

DeepStream Single-View 3D

The latest release of NVIDIA DeepStream introduces significant enhancements to the Single-View 3D Tracking (SV3DT) tracker. This advanced feature is designed to accurately track objects within a 3D space using just a single mono-camera, providing precise localization of objects on a 3D world ground plane.

This first version of SV3DT models pedestrians as cylinders on the ground plane. This approach ensures more accurate localization by positioning the feet at the bottom of the cylinder, offering a clearer, more defined representation of movement, and positioning regardless of the level of occlusion.

For more information, see the following resources:

Mitigating Occlusions in Visual Perception Using Single-View 3D Tracking in NVIDIA DeepStream
Single-View 3D Tracking (Alpha) in the NVIDIA DeepStream SDK Developer Guide

Support for BEVFusion with the DeepStream 3D framework

DeepStream 7.0 brings support to one of the most exciting AI models for sensor fusion: BEVFusion. DeepStream 7.0 enhances the DeepStream 3D (DS3D) framework and adds both LIDAR and radar inputs that can be fused with camera inputs. NVIDIA has a commitment to bring the next generation of environmental perception solutions.

Integration with various sensors is streamlined through a low-level library, available as source code, simplifying support for different sensor vendors.

BEVFusion’s integration into the DS3D framework offers a suite of features designed to boost functionality and ease of use:

Easy visualization: Rendering and rotation for LIDAR or 3D data on-screen, projection of LIDAR data into images, and the display of 3D bounding boxes from multiple viewpoints.
Message broker support: Default integration with message brokers for fast and efficient integration with other subsystems.
Sensor sync: Robust synchronization for multi-sensor data (including LIDAR, radar, and cameras), supporting both file inputs and livestreams. This feature accommodates varying frame rates and manages frame drops, crucial for real-world application adaptability.
Alignment filter: Data alignment from different sensors according to their intrinsic and extrinsic parameters, enabling precise customization suited to various sensor data alignments.
Custom 3D data preprocessing: Provides support for tailored preprocessing needs for LIDAR and radar data, enhancing processing accuracy and flexibility.
Generic data map management: A comprehensive array of sensor and tensor data managed through a key-value system, streamlining data oversight and manipulation.

With these features, DeepStream 7.0 with BEVFusion stands at the forefront of 3D AI development, pushing the boundaries of what’s possible with sensor fusion technology that can be deployed from the edge to the cloud.

For more information, see the DeepStream 3D Framework documentation.

Support for Windows Subsystem for Linux

DeepStream applications can now be developed right on your Windows system using the Windows Subsystem for Linux (WSL2). This update is a significant stride forward and a frequent request from customers who develop on IT-approved systems where Windows is the standard.

With the integration of DeepStream on WSL2, you can streamline your workflow on a single system, eliminating the need for remote access to Linux systems. This new capability ensures that you can use the powerful features of DeepStream without the complexity of dual-system setups, simplifying the development process and enhancing productivity.

DeepStream support for WSL2 offers the flexibility and convenience needed to develop advanced applications directly on Windows. Embrace the ease of Windows compatibility while benefiting from the robust capabilities of DeepStream.

For more information, see the WSL2 documentation.

PipeTuner 1.0: Streamlining AI pipeline optimization

PipeTuner 1.0 is a new developer tool poised to revolutionize the tuning of AI pipelines. AI services typically incorporate a wide array of parameters for inference and tracking. Finding the optimal settings to maximize accuracy for specific use cases is a complex and critical process.

Traditionally, manual tuning demands deep knowledge of each pipeline module and becomes impractical with extensive, high-dimensional parameter spaces—even with the support of datasets and ground-truth labels for accuracy analysis.

PipeTuner is designed to address these challenges head-on. PipeTuner efficiently explores the parameter space and automates the process of identifying the best parameters, achieving the highest possible key performance indicators (KPIs) based on the dataset provided by the user. Crucially, PipeTuner simplifies this process so that technical knowledge of the pipeline and its parameters is not required by the user.

PipeTuner 1.0 is in Developer Preview.

By integrating PipeTuner, you can accelerate your time to market and tailor DeepStream pipeline parameters for each deployment location, ensuring optimal performance in every scenario. This marks a significant step forward in making sophisticated AI pipeline optimization accessible and effective for everyone across all use cases and scenarios.

Start PipeTuning now!

Video 3. Automatically Optimize DeepStream Vision AI Apps with PipeTuner

For more information, see the PipeTuner documentation.

Summary

We’re excited to see how you use these new tools and capabilities available on the latest DeepStream SDK 7.0 release to create something extraordinary.

Get started now and dive deeper in the DeepStream forum.

As always, happy DeepStreaming!

Discuss (0)

About the Authors

About Carlos Garcia-Sierra
Carlos is a product manager at NVIDIA, focusing on making AI accessible to all. Carlos manages the DeepStream SDK, enabling new uses across a diverse set of markets. He's held multiple roles at NVIDIA, including strategic partnerships for the Metropolis business unit, business development for the Automotive BU, and technical marketing. In his free time, Carlos enjoys hiking with his family, mountain biking, reading, and growing organic food.

View all posts by Carlos Garcia-Sierra

About Debraj Sinha
Debraj Sinha is a Product Marketing Manager for Metropolis at NVIDIA, focusing on building smarter spaces around the world with AI-enabled video analytics. Debraj collaborates with partners ranging from startups to Fortune 500 companies to market AI applications that drive safety and efficiency gains. He holds an MBA degree from Haas School of Business, University of California, Berkeley and a Master's degree in Computer Science from Cornell University.

View all posts by Debraj Sinha