Recommendations:
GPU Time, % of Elapsed time: 29.6%GPU utilization is low. Switch to the for in-depth analysis of host activity. Poor GPU utilization can prevent the application from offloading effectively.XVE Array Stalled/Idle: 56.6% of Elapsed time with GPU busy
GPU metrics detect some kernel issues. Use GPU Compute/Media Hotspots (preview) to understand how well your application runs on the specified hardware.
GPU utilization is low. Consider offloading more work to the GPU to increase overall application performance.
| GPU Adapter | GPU Engine | GPU Time | GPU Time, % of Elapsed time(%) |
|---|---|---|---|
| GPU 1 | Render and GPGPU | 3.927s | 29.6% |
| Function | Module | CPU Time |
|---|---|---|
| asm_exc_page_fault | vmlinux | 4.544s |
| [Skipped stack frame(s)] | [Unknown] | 1.836s |
| _raw_spin_lock | vmlinux | 1.600s |
| asm_exc_int3 | vmlinux | 1.400s |
| memcmp | libc-dynamic.so | 1.300s |
| [Others] | N/A | 18.163s |
| Host Task | Task Time | % of Elapsed Time(%) | Task Count |
|---|---|---|---|
| zeEventHostSynchronize | 3.942s | 29.7% | 102 |
| zeCommandListAppendMemoryCopy | 0.374s | 2.8% | 5 |
| zeModuleCreate | 0.088s | 0.7% | 1 |
| zeCommandListAppendLaunchKernel | 0.001s | 0.0% | 100 |
| zeCommandListCreateImmediate | 0.001s | 0.0% | 3 |
| [Others] | 0.000s | 0.0% | 5 |
| Computing Task | Total Time | Execution Time | % of Total Time(%) | SIMD Width |
|---|---|---|---|---|
| iso3dfd(sycl::_V1::queue&, float*, float*, float*, float*, unsigned long, unsigned long, unsigned long, unsigned long, unsigned long, unsigned long, unsigned long)::{lambda()#1}::operator()<sycl::_V1::handler>(, signed char) const::{lambda()#1} | 3.923s | 3.921s | 100.0% | 32 |
| [Outside any task] | 0.752s | 0s | 0.0% |