The percentage of time when the EUs were stalled or idle is high, which has a negative impact on compute-bound applications.
L3 bandwidth was high when EUs were stalled or idle. Consider improving cache reuse.
| Computing Task | Total Time |
|---|---|
| dppyPy_dppy_py_devfn__5F__5F_main_5F__5F__2E_run_5F_knn_5F_kernel_24_1_2E_array_28_float64_2C__20_2d_2C__20_C_29__2E_array_28_int64_2C__20_1d_2C__20_C_29__2E_array_28_float64_2C__20_2d_2C__20_C_29__2E_int64_2E_int64_2E_int64_2E_array_28_float64_2C__20_1d_2C__20_C_29__2E_array_28_float64_2C__20_2d_2C__20_C_29__2E_int64 | 195.403s |
| Computing Task | Total Time |
|---|
| Computing Task | Total Time |
|---|