The percentage of time when the XVEs were stalled or idle is high, which has a negative impact on compute-bound applications.
| GPU Stack | GPU Adapter | XVE Array Active(%) | XVE Array Stalled(%) | XVE Array Idle(%) |
|---|---|---|---|---|
| 0 | GPU 1 | 0.0% | 0.0% | 100.0% |
| 0 | GPU 3 | 0.0% | 0.0% | 100.0% |
| 0 | GPU 0 | 21.4% | 16.3% | 62.3% |
| 0 | GPU 2 | 0.0% | 0.0% | 100.0% |
Several factors including shared local memory, use of memory barriers, and inefficient work scheduling can cause a low value of the occupancy metric.
| Computing Task | Total Time | Occupancy(%) | SIMD Utilization(%) |
|---|---|---|---|
| iso3dfd(sycl::_V1::queue&, float*, float*, float*, float*, unsigned long, unsigned long, unsigned long, unsigned long)::{lambda(sycl::_V1::handler&)#1}::operator()(sycl::_V1::handler&) const::{lambda(sycl::_V1::id<(int)3>)#1} | 6.713s | 21.3% of peak value | 25.0% |