Intel® VTune Profiler 2022.0.0

Recommendations:

GPU Time, % of Elapsed time: 20.0%
GPU utilization is low. Switch to the for in-depth analysis of host activity. Poor GPU utilization can prevent the application from offloading effectively.
EU Array Stalled/Idle: 39.8% of Elapsed time with GPU busy
GPU metrics detect some kernel issues. Use GPU Compute/Media Hotspots (preview) to understand how well your application runs on the specified hardware.