Intel
®
VTune
™
Profiler 2022.2.0
Elapsed Time:
13.524s
GPU Time:
2.148s
EU Array Stalled/Idle:
13.6% of Elapsed time with GPU busy
GPU L3 Bandwidth Bound:
61.5% of peak value
L3 bandwidth was high when EUs were stalled or idle. Consider improving cache reuse.
Hottest GPU Computing Tasks Bound by GPU L3 Bandwidth:
Computing Task
Total Time
Sampler Busy:
0.0% of peak value
Hottest GPU Computing Tasks with High Sampler Usage:
Computing Task
Total Time
FPU Utilization:
18.1% of Elapsed time with GPU busy
Hottest GPU Computing Tasks with High FPU Utilization:
Computing Task
Total Time
Collection and Platform Info:
Application Command Line:
python "lab/gpairs_gpu.py" "--steps" "1" "--size" "65536"
User Name:
u137620
Operating System:
5.4.0-80-generic DISTRIB_ID=Ubuntu DISTRIB_RELEASE=20.04 DISTRIB_CODENAME=focal DISTRIB_DESCRIPTION="Ubuntu 20.04.4 LTS"
Computer Name:
s001-n157
Result Size:
254.4 MB
Collection start time:
03:10:06 05/06/2022 UTC
Collection stop time:
03:10:20 05/06/2022 UTC
Collector Type:
Event-based sampling driver,User-mode sampling and tracing
CPU:
Name:
Intel(R) microarchitecture code named Coffeelake
Frequency:
3.696 GHz
Logical CPU Count:
12
GPU:
Name:
HD Graphics P630
Vendor:
Intel Corporation
EU Count:
24
Max EU Thread Count:
7
Max Core Frequency:
1.200 GHz
GPU OpenCL Info:
Version:
OpenCL 3.0 NEO
Max Compute Units:
24
Max Work Group Size:
256
Local Memory:
65.5 KB
SVM Capabilities: