Build your high-performance model inference solution with DJL and ONNX Runtime

YouTube
CONFLUENCE

Description

In many companies, Java is the primary language for the teams to build up services. To have ONNX model onboard and integration, developers faced several technical challenges on the resource allocation and performance tuning. In this talk, we will walk you through the inference solution built by DJL, a ML library in Java. In the meantime, we will share some customer success stories with model hosting using ONNXRuntime and DJL.

OnnxVideo

Build your high-performance model inference solution with DJL and ONNX Runtime

Description

Details