Runtime

Run the inference with input data on the provided loadable.

  1. Insert the previous NvSciFence.

    Insert the previous NvSciFence to the instance. The operation is blocked until the expiration of the previous NvSciFence. The related API is NvMediaDlaInsertPreNvSciSyncFence.

  2. Submit.

    Submit a task with specified inputs and outputs to the hardware engine with NvMediaDlaSubmit. To submit a task and skip execution on the DLA hardware, use NvMediaDlaSubmitBypass.

    These are non-blocking calls. Applications can choose to block and wait for an operation on a particular buffer to complete. To do so, use the following NvMediaTensorGetStatus function, as applicable.

  3. Get NvSciFence.

    Get the end-of-frame (EOF) or start-of-frame (SOF) from the instance. The expiration of EOF indicates the completion of the operation and the expiration of SOF indicates the start of the operation. The related APIs are NvMediaDlaGetEOFNvSciSyncFence and NvMediaDlaGetSOFNvSciSyncFence.