Recommendations:
GPU Time, % of Elapsed time: 75.2%GPU utilization is low. Switch to the for in-depth analysis of host activity. Poor GPU utilization can prevent the application from offloading effectively.XVE Array Stalled/Idle: 52.4% of Elapsed time with GPU busy
GPU metrics detect some kernel issues. Use GPU Compute/Media Hotspots (preview) to understand how well your application runs on the specified hardware.
GPU utilization is low. Consider offloading more work to the GPU to increase overall application performance.
| GPU Adapter | GPU Engine | GPU Time | GPU Time, % of Elapsed time(%) |
|---|---|---|---|
| GPU 1 | Render and GPGPU | 27.734s | 75.2% |
| Function | Module | CPU Time |
|---|---|---|
| asm_exc_page_fault | vmlinux | 2.920s |
| [Skipped stack frame(s)] | [Unknown] | 1.644s |
| operator new | libc++abi.so | 1.484s |
| func@0x13f9b0 | libze_intel_gpu.so.1.3.27191.42 | 1.434s |
| memcmp | libc-dynamic.so | 1.420s |
| [Others] | N/A | 15.764s |
| Host Task | Task Time | % of Elapsed Time(%) | Task Count |
|---|---|---|---|
| zeEventHostSynchronize | 21.643s | 58.7% | 14 |
| zeCommandListAppendMemoryCopy | 6.066s | 16.4% | 1 |
| zeModuleCreate | 0.259s | 0.7% | 1 |
| zeCommandListAppendMemoryCopyRegion | 0.071s | 0.2% | 5 |
| zeCommandListCreateImmediate | 0.001s | 0.0% | 3 |
| [Others] | 0.001s | 0.0% | 105 |
| Computing Task | Total Time | Execution Time | % of Total Time(%) | SIMD Width |
|---|---|---|---|---|
| iso3dfd(sycl::_V1::queue&, float*, float*, float*, float*, unsigned long, unsigned long, unsigned long, unsigned long)::{lambda(sycl::_V1::handler&)#1}::operator()(sycl::_V1::handler&) const::{lambda(sycl::_V1::id<(int)3>)#1} | 27.791s | 14.766s | 53.1% | 32 |