ONNX is a shared file format that lets a model leave its original framework, such as PyTorch, and run anywhere that supports ONNX — different hardware, different serving engines. It is the usual bridge between research code and a production runtime.