ONNX Import
Introduction
As deep learning evolves, interoperability between frameworks becomes crucial. Burn provides robust
support for importing ONNX (Open Neural Network Exchange)
models through the burn-onnx crate, enabling you to
leverage pre-trained models in your Rust-based deep learning projects.
Why Import Models?
Importing pre-trained models offers several advantages:
- Time-saving: Skip the resource-intensive process of training models from scratch.
- Access to state-of-the-art architectures: Utilize cutting-edge models developed by researchers and industry leaders.
- Transfer learning: Fine-tune imported models for your specific tasks, benefiting from knowledge transfer.
- Consistency across frameworks: Maintain consistent performance when moving between frameworks.
Understanding ONNX
ONNX (Open Neural Network Exchange) is an open format designed to represent machine learning models with these key features:
- Framework agnostic: Provides a common format that works across various deep learning frameworks.
- Comprehensive representation: Captures both the model architecture and trained weights.
- Wide support: Compatible with popular frameworks like PyTorch, TensorFlow, and scikit-learn.
This standardization allows seamless movement of models between different frameworks and deployment environments.
Burn's ONNX Support
Burn's approach to ONNX import offers unique advantages:
- Native Rust code generation: Translates ONNX models into Rust source code for deep integration with Burn's ecosystem.
- Compile-time optimization: Leverages the Rust compiler to optimize the generated code, potentially improving performance.
- No runtime dependency: Eliminates the need for an ONNX runtime, unlike many other solutions.
- Trainability: Allows imported models to be further trained or fine-tuned using Burn.
- Portability: Enables compilation for various targets, including WebAssembly and embedded devices.
- Backend flexibility: Works with any of Burn's supported backends.
ONNX Compatibility
Burn recommends ONNX models use opset version 16 or higher for best compatibility. While models with older opset versions may work, opset 16+ ensures access to all supported operators and their latest behavior. If you encounter issues with an older model, consider upgrading it using the ONNX version converter.
Upgrading ONNX Models
There are two simple ways to upgrade your ONNX models to the recommended opset version:
Option 1: Use the provided utility script:
uv run --script https://raw.githubusercontent.com/tracel-ai/burn-onnx/refs/heads/main/onnx_opset_upgrade.py
Option 2: Use a custom Python script:
import onnx
from onnx import version_converter, shape_inference
# Load your ONNX model
model = onnx.load('path/to/your/model.onnx')
# Convert the model to opset version 16
upgraded_model = version_converter.convert_version(model, 16)
# Apply shape inference to the upgraded model
inferred_model = shape_inference.infer_shapes(upgraded_model)
# Save the converted model
onnx.save(inferred_model, 'upgraded_model.onnx')
Step-by-Step Guide
Follow these steps to import an ONNX model into your Burn project:
Step 1: Update Cargo.toml
First, add the required dependencies to your Cargo.toml:
[dependencies]
burn = { version = "~0.21", features = ["flex"] }
[build-dependencies]
burn-onnx = "~0.21"
Step 2: Update build.rs
In your build.rs file:
use burn_onnx::ModelGen;
fn main() {
ModelGen::new()
.input("src/model/my_model.onnx")
.out_dir("model/")
.run_from_script();
}
This generates Rust code and a .bpk weights file from your ONNX model during the build process.
Step 3: Modify mod.rs
In your src/model/mod.rs file, include the generated code:
pub mod my_model {
include!(concat!(env!("OUT_DIR"), "/model/my_model.rs"));
}
Step 4: Use the Imported Model
Now you can use the imported model in your code:
use burn::tensor;
use burn::backend::{Flex, flex::FlexDevice};
use model::my_model::Model;
fn main() {
let device = FlexDevice;
// Create model instance and load weights from target dir default device
let model: Model<Flex> = Model::default();
// Create input tensor (replace with your actual input)
let input = tensor::Tensor::<Flex, 4>::zeros([1, 3, 224, 224], &device);
// Perform inference
let output = model.forward(input);
println!("Model output: {:?}", output);
}
Advanced Configuration
The ModelGen struct provides configuration options:
use burn_onnx::{ModelGen, LoadStrategy};
ModelGen::new()
.input("path/to/model.onnx")
.out_dir("model/")
.development(true) // Enable development mode for debugging
.load_strategy(LoadStrategy::Embedded) // Embed weights in the binary
.run_from_script();
input: Path to the ONNX model fileout_dir: Output directory for generated code and weightsdevelopment: When enabled, generates additional debug files (.onnx.txt,.graph.txt)load_strategy: Controls which weight-loading constructors are generated on theModelstruct (see below)
Model weights are stored in Burnpack format (.bpk), which provides efficient serialization and
loading.
Load Strategy
The LoadStrategy enum controls how the generated model loads its weights:
| Strategy | Generated constructors | Default impl | Use case |
|---|---|---|---|
File | from_file(), from_bytes() | Yes | Standard desktop/server (default) |
Embedded | from_embedded(), from_bytes() | Yes | Single binary, small models |
Bytes | from_bytes() | No | WASM, embedded, custom loaders |
None | (none) | No | Manual weight management |
The default strategy is File, which keeps weights in a separate .bpk file and generates a
from_file() constructor.
For WebAssembly or environments without filesystem access, use LoadStrategy::Bytes:
ModelGen::new()
.input("model.onnx")
.out_dir("model/")
.load_strategy(LoadStrategy::Bytes)
.run_from_script();
Then load weights at runtime from any byte source (e.g., a network fetch):
let model = Model::<Backend>::from_bytes(weight_bytes, &device);
Loading and Using Models
You can load models in several ways, depending on the LoadStrategy used during code generation:
// Load from the output directory with default device (recommended for most use cases)
// This automatically loads weights from the .bpk file
// Available with LoadStrategy::File or LoadStrategy::Embedded
let model = Model::<Backend>::default();
// Create a new model instance with a specific device
// (initializes weights randomly; load weights via `load_from` afterward)
let model = Model::<Backend>::new(&device);
// Load from a specific .bpk file (LoadStrategy::File)
let model = Model::<Backend>::from_file("path/to/weights.bpk", &device);
// Load from in-memory bytes (LoadStrategy::File, Embedded, or Bytes)
let model = Model::<Backend>::from_bytes(weight_bytes, &device);
// Load from embedded weights (LoadStrategy::Embedded)
let model = Model::<Backend>::from_embedded(&device);
Troubleshooting
Common issues and solutions:
-
Unsupported ONNX operator: Check the list of supported ONNX operators. You may need to simplify your model or wait for support.
-
Build errors: Ensure your
burn-onnxversion matches your Burn version and verify the ONNX file path inbuild.rs. -
Runtime errors: Confirm that your input tensors match the expected shape and data type of your model.
-
Performance issues: Consider using a more performant backend or optimizing your model architecture.
-
Viewing generated files: Find the generated Rust code and weights in the
OUT_DIRdirectory (usuallytarget/debug/build/<project>/out).
Examples and Resources
For practical examples, check out the burn-onnx examples:
- ONNX Inference - MNIST inference example
- Image Classification Web - SqueezeNet running in the browser via WebAssembly
- Raspberry Pi Pico - Embedded deployment example
These demonstrate real-world usage of ONNX import in Burn projects.
For contributors looking to add support for new ONNX operators:
- Development Guide - Step-by-step guide for implementing new operators
Conclusion
Importing ONNX models into Burn combines the vast ecosystem of pre-trained models with Burn's performance and Rust's safety features. Following this guide, you can seamlessly integrate ONNX models into your Burn projects for inference, fine-tuning, or further development.
The burn-onnx crate is actively developed, with ongoing work to support more ONNX operators and
improve performance. Visit the burn-onnx repository for
updates and to contribute!