================================================================================================
Benchmark for performance of JSON parsing
================================================================================================

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
JSON schema inferring:                    Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        2240           2300          98          2.2         448.0       1.0X
UTF-8 is set                                       3325           3333           8          1.5         665.0       0.7X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
count a short column:                     Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        1890           1917          23          2.6         378.1       1.0X
UTF-8 is set                                       3155           3158           3          1.6         630.9       0.6X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
count a wide column:                      Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        5079           5266         188          0.2        5078.9       1.0X
UTF-8 is set                                       4272           4280           6          0.2        4272.5       1.2X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
select wide row:                          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
No encoding                                        9614           9866         271          0.0      192271.0       1.0X
UTF-8 is set                                      10517          10608          80          0.0      210331.2       0.9X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
Select a subset of 10 columns:            Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Select 10 columns                                  1661           1666           6          0.6        1660.6       1.0X
Select 1 column                                    1078           1081           2          0.9        1078.3       1.5X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
creation of JSON parser per line:         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Short column without encoding                       604            612          10          1.7         604.3       1.0X
Short column with UTF-8                             828            839          15          1.2         828.3       0.7X
Wide column without encoding                       7212           7255          38          0.1        7212.1       0.1X
Wide column with UTF-8                             7446           7462          15          0.1        7445.8       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
JSON functions:                           Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                            60             63           2         16.6          60.1       1.0X
from_json                                          1168           1175           7          0.9        1168.4       0.1X
json_tuple                                         1158           1170          16          0.9        1158.4       0.1X
get_json_object wholestage off                     1075           1081           6          0.9        1074.8       0.1X
get_json_object wholestage on                      1018           1029          13          1.0        1018.1       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
Dataset of json strings:                  Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                           232            238          10         21.6          46.4       1.0X
schema inferring                                   1919           1928           9          2.6         383.7       0.1X
parsing                                            2717           2724           7          1.8         543.4       0.1X

Preparing data for benchmarking ...
OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
Json files in the per-line mode:          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Text read                                           562            569           7          8.9         112.5       1.0X
Schema inferring                                   2424           2432           9          2.1         484.8       0.2X
Parsing without charset                            2808           2810           3          1.8         561.7       0.2X
Parsing with UTF-8                                 3993           4001          12          1.3         798.5       0.1X

OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
Write dates and timestamps:               Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
Create a dataset of timestamps                      101            108           7          9.9         101.4       1.0X
to_json(timestamp)                                  705            707           2          1.4         704.6       0.1X
write timestamps to files                           598            611          20          1.7         598.3       0.2X
Create a dataset of dates                           112            118          10          8.9         111.9       0.9X
to_json(date)                                       546            548           2          1.8         546.3       0.2X
write dates to files                                393            399           9          2.5         393.1       0.3X

OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
Read dates and timestamps:                                             Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
-----------------------------------------------------------------------------------------------------------------------------------------------------
read timestamp text from files                                                   149            153           4          6.7         149.4       1.0X
read timestamps from files                                                      1044           1049           5          1.0        1043.6       0.1X
infer timestamps from files                                                     1973           1983          12          0.5        1972.7       0.1X
read date text from files                                                        140            143           5          7.1         140.0       1.1X
read date from files                                                             690            698           9          1.4         690.3       0.2X
timestamp strings                                                                141            149           7          7.1         140.8       1.1X
parse timestamps from Dataset[String]                                           1265           1266           2          0.8        1264.5       0.1X
infer timestamps from Dataset[String]                                           2160           2169          12          0.5        2160.5       0.1X
date strings                                                                     248            250           2          4.0         248.3       0.6X
parse dates from Dataset[String]                                                1010           1015           6          1.0        1009.6       0.1X
from_json(timestamp)                                                            1781           1810          27          0.6        1781.1       0.1X
from_json(date)                                                                 1510           1514           4          0.7        1510.0       0.1X
infer error timestamps from Dataset[String] with default format                 1412           1420           8          0.7        1412.2       0.1X
infer error timestamps from Dataset[String] with user-provided format           1372           1378           6          0.7        1371.6       0.1X
infer error timestamps from Dataset[String] with legacy format                  1427           1439          18          0.7        1426.6       0.1X

OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
Filters pushdown:                         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
w/o filters                                        6100           6124          33          0.0       61003.7       1.0X
pushdown disabled                                  5957           5981          31          0.0       59569.9       1.0X
w/ filters                                          729            737           8          0.1        7291.0       8.4X

OpenJDK 64-Bit Server VM 21.0.4+7-LTS on Linux 6.5.0-1025-azure
AMD EPYC 7763 64-Core Processor
Partial JSON results:                     Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
parse invalid JSON                                 2476           2480           5          0.0      247550.8       1.0X


