Interoperating with Spox

Spox defines a Pythonic, direct interface to the ONNX standard and is an effective way to construct ONNX graphs. ndonnx builds on Spox and so it is able to provide excellent interoperability with the lower level interface Spox has when needed.

When is it useful to interoperate with Spox?

You want to directly access an ONNX operator that describes your intent. High level operators like LSTM or TreeEnsemble are not directly accessible from ndonnx since they aren’t natural array operations.
You want to reuse an existing ONNX model. Spox’s Inline tool is a powerful utility for this.
You want to dispatch to certain specialized custom operators that go beyond the scope of the ONNX standard. This is also achieved via Spox’s Inline tool.

Moving between Spox and ndonnx

ndonnx to Spox

Extract a spox.Var from an ndonnx.Array using the ndonnx.Array.unwrap_spox() method. This allows you to use Spox to manipulate the ONNX graph more directly.

import spox
import ndonnx as ndx
import spox.opset.ai.onnx.v20 as op

x = ndx.argument(shape=(2, 1), dtype=ndx.utf8)

var = x.unwrap_spox()

# Freely use Spox
var = op.string_split(var, delimiter=",")

While this is trivial in the case of data types with a direct correspondence in ONNX, arrays with nullable data types like ndonnx.nutf8 must first be decomposed into their constituent parts. This is achieved via the :meth:ndonnx.Array.disassemble method.

import ndonnx as ndx

x = ndx.argument(shape=(2, 1), dtype=ndx.nutf8)

data_var = x.disassemble()["data"]
null_var = x.disassemble()["null"]

Spox to ndonnx

Taking the var from the previous example, we can convert it back to an ndonnx.Array using the ndonnx.asarray() function.

array = ndx.asarray(var)

Implementing a one-hot-encode function

Spox allows you to directly access the standard ONNX operators. This means that while ndonnx does not directly expose a one_hot_encode function, since the ONNX standard has a specialized operator for this, we can implement a function that one-hot-encodes ndonnx arrays by going through Spox.

from typing import Iterable
import ndonnx as ndx
import spox.opset.ai.onnx.ml.v3 as ml

def one_hot_encode(x: ndx.Array, categories: Iterable[str]) -> ndx.Array:
    return ndx.asarray(ml.one_hot_encoder(x.unwrap_spox(), cats_strings=categories))

We can use this as normal to export and run an ONNX model.

import onnxruntime as ort
import ndonnx as ndx

x = ndx.argument(shape=("N",), dtype=ndx.utf8)
y = one_hot_encode(x, ["a", "b", "c"])

model = ndx.build({"x": x}, {"y": y})
onnx.save(model, "one_hot_encode.onnx")

sess = ort.InferenceSession("one_hot_encode.onnx")
out, = sess.run(None, {"x": ["c", "b", "b", "a"]})
print(out)
# [[0. 0. 1.]
# [0. 1. 0.]
# [0. 1. 0.]
# [1. 0. 0.]]

Note

See Value propagation with custom operators on how to ensure that value propagation is functioning even when using custom ONNX operators.