Support additional Execution Providers in ONNX `wasi-nn` backend #8547

kaivol · 2024-05-03T21:38:47Z

Feature

Currently the ONNX backend in wasmtime-wasi-nn only uses the default CPU execution provider and ignores the ExecutionTarget requested by the WASM caller.

wasmtime/crates/wasi-nn/src/backend/onnxruntime.rs

Lines 21 to 33 in 24c1388

    
           fn load(&mut self, builders: &[&[u8]], target: ExecutionTarget) -> Result<Graph, BackendError> { 
        
               if builders.len() != 1 { 
        
                   return Err(BackendError::InvalidNumberOfBuilders(1, builders.len()).into()); 
        
               } 
        
               let session = Session::builder()? 
        
                   .with_optimization_level(GraphOptimizationLevel::Level3)? 
        
                   .with_model_from_memory(builders[0])?; 
        
               let box_: Box<dyn BackendGraph> = 
        
                   Box::new(ONNXGraph(Arc::new(Mutex::new(session)), target)); 
        
               Ok(box_.into()) 
        
           }

I would like to suggest adding support for additional execution providers (CUDA, TensorRT, ROCm, ...) to wasmtime-wasi-nn.

Benefit

Improved performance for WASM modules using the wasi-nn API.

Implementation

ort already has support for many execution providers, so integrating these into wasmtime-wasi-nn should not be to much work.
I would be interested in looking into this, however, I only really have the means to test the DirectML and NVIDIA CUDA / TensorRT EPs.

Alternatives

Leave it to the users to add support for additional execution providers.

The text was updated successfully, but these errors were encountered:

abrown · 2024-06-11T17:19:08Z

I was looking at old issues and ran across this one (sorry for such a late reply!): I completely agree with this idea. I am tempted to say "go for it!" but maybe there is some coordination needed. E.g., I think @jianjunz has started enabling some DirectML bits in #8756. And @devigned may have some opinions on the best way to do this. But from my perspective, this seems like a worthwhile avenue to pursue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support additional Execution Providers in ONNX `wasi-nn` backend #8547

Support additional Execution Providers in ONNX `wasi-nn` backend #8547

kaivol commented May 3, 2024

abrown commented Jun 11, 2024

Support additional Execution Providers in ONNX wasi-nn backend #8547

Support additional Execution Providers in ONNX wasi-nn backend #8547

Comments

kaivol commented May 3, 2024

Feature

Benefit

Implementation

Alternatives

abrown commented Jun 11, 2024

Support additional Execution Providers in ONNX `wasi-nn` backend #8547

Support additional Execution Providers in ONNX `wasi-nn` backend #8547