Intel® VTune Profiler 2022.0.0

Recommendations:

GPU Time, % of Elapsed time: 23.2%
GPU utilization is low. Switch to the for in-depth analysis of host activity. Poor GPU utilization can prevent the application from offloading effectively.
EU Array Stalled/Idle: 22.2% of Elapsed time with GPU busy
GPU metrics detect some kernel issues. Use GPU Compute/Media Hotspots (preview) to understand how well your application runs on the specified hardware.