MBG wiki
|
RecentChanges
|
Blog
|
2024-04-20
|
2024-04-19
Editing NAS Parallel Benchmarks, v.2.3, 4 old nodes
The benchmarks were built with the Intel compilers. MPICH was used as the MPI implementation. Process migration to the newest nodes was avoided by locking the ssh deamon (not the best solution). The nodes were pc13, pc14, pc15, pc16 (733 MHz, PIIIs). Results : ---- == NAS Parallel Benchmarks 2.3 -- BT Benchmark == <pre> No input file inputbt.data. Using compiled defaults Size: 64x 64x 64 Iterations: 200 dt: 0.000800 Number of active processes: 4 Time step 1 Time step 20 Time step 40 Time step 60 Time step 80 Time step 100 Time step 120 Time step 140 Time step 160 Time step 180 Time step 200 Verification being performed for class A accuracy setting for epsilon = 0.1000000000000E-07 Comparison of RMS-norms of residual 1 0.1080634671464E+03 0.1080634671464E+03 0.6969749535109E-14 2 0.1131973090122E+02 0.1131973090122E+02 0.1255405701709E-14 3 0.2597435451158E+02 0.2597435451158E+02 0.3282665917769E-14 4 0.2366562254468E+02 0.2366562254468E+02 0.8406791988557E-14 5 0.2527896321175E+03 0.2527896321175E+03 0.1394160013545E-13 Comparison of RMS-norms of solution error 1 0.4234841604053E+01 0.4234841604053E+01 0.1468118413674E-14 2 0.4439028249700E+00 0.4439028249700E+00 0.6252624235385E-14 3 0.9669248013635E+00 0.9669248013635E+00 0.3789059889763E-14 4 0.8830206303977E+00 0.8830206303977E+00 0.0000000000000E+00 5 0.9737990177083E+01 0.9737990177083E+01 0.2553811957023E-14 Verification Successful BT Benchmark Completed. Class = A Size = 64x 64x 64 Iterations = 200 Time in seconds = 917.25 Total processes = 4 Compiled procs = 4 Mop/s total = 183.47 Mop/s/process = 45.87 Operation type = floating point Verification = SUCCESSFUL Version = 2.3 Compile date = 30 Jun 2004 Compile options: MPIF77 = mpif77 FLINK = mpif77 FMPI_LIB = -L/usr/local/lib FMPI_INC = -I/usr/local/include FFLAGS = -O2 FLINKFLAGS = -static RAND = (none) Please send the results of this run to: NPB Development Team Internet: npb@nas.nasa.gov If email is not available, send this to: MS T27A-1 NASA Ames Research Center Moffett Field, CA 94035-1000 Fax: 415-604-3957 </pre> ---- == NAS Parallel Benchmarks 2.3 -- CG Benchmark == <pre> Size: 14000 Iterations: 15 Number of active processes: 4 iteration ||r|| zeta 1 0.14674662350377E-12 19.9997581277040 2 0.13825296499213E-14 17.1140495745506 3 0.13582200639364E-14 17.1296668946143 4 0.13390743666272E-14 17.1302113581192 5 0.13151128694994E-14 17.1302338856353 6 0.12717775790555E-14 17.1302349879482 7 0.12434040923854E-14 17.1302350498916 8 0.12169541498574E-14 17.1302350537510 9 0.11851688069578E-14 17.1302350540101 10 0.11493047350287E-14 17.1302350540284 11 0.11175026684273E-14 17.1302350540298 12 0.10968717022187E-14 17.1302350540299 13 0.10439446919306E-14 17.1302350540299 14 0.10142396619043E-14 17.1302350540299 15 0.98043441651967E-15 17.1302350540299 Benchmark completed VERIFICATION SUCCESSFUL Zeta is 0.171302350540E+02 Error is 0.891731133379E-12 CG Benchmark Completed. Class = A Size = 14000 Iterations = 15 Time in seconds = 20.69 Total processes = 4 Compiled procs = 4 Mop/s total = 72.33 Mop/s/process = 18.08 Operation type = floating point Verification = SUCCESSFUL Version = 2.3 Compile date = 30 Jun 2004 </pre> ---- == NAS Parallel Benchmarks 2.3 -- EP Benchmark == <pre> Number of random numbers generated: 536870912 Number of active processes: 4 EP Benchmark Results: CPU Time = 35.0941 N = 2^ 28 No. Gaussian Pairs = 210832767. Sums = -4.295875165634738D+03 -1.580732573678648D+04 Counts: 0 98257395. 1 93827014. 2 17611549. 3 1110028. 4 26536. 5 245. 6 0. 7 0. 8 0. 9 0. EP Benchmark Completed. Class = A Size = 536870912 Iterations = 0 Time in seconds = 35.09 Total processes = 4 Compiled procs = 4 Mop/s total = 15.30 Mop/s/process = 3.82 Operation type = Random numbers generated Verification = SUCCESSFUL Version = 2.3 Compile date = 30 Jun 2004 </pre> ---- == NAS Parallel Benchmarks 2.3 -- FT Benchmark == <pre> No input file inputft.data. Using compiled defaults Size : 256x256x128 Iterations : 6 Number of processes : 4 Processor array : 1x 4 Layout type : 1D T = 1 Checksum = 5.046735008193D+02 5.114047905510D+02 T = 2 Checksum = 5.059412319734D+02 5.098809666433D+02 T = 3 Checksum = 5.069376896287D+02 5.098144042213D+02 T = 4 Checksum = 5.077892868474D+02 5.101336130759D+02 T = 5 Checksum = 5.085233095391D+02 5.104914655194D+02 T = 6 Checksum = 5.091487099959D+02 5.107917842803D+02 Result verification successful class = A FT Benchmark Completed. Class = A Size = 256x256x128 Iterations = 6 Time in seconds = 64.84 Total processes = 4 Compiled procs = 4 Mop/s total = 110.07 Mop/s/process = 27.52 Operation type = floating point Verification = SUCCESSFUL Version = 2.3 Compile date = 30 Jun 2004 </pre> ---- == NAS Parallel Benchmarks 2.3 -- IS Benchmark == <pre> Size: 8388608 (class A) Iterations: 10 Number of processes: 4 iteration 1 2 3 4 5 6 7 8 9 10 IS Benchmark Completed Class = A Size = 8388608 Iterations = 10 Time in seconds = 21.74 Total processes = 4 Compiled procs = 4 Mop/s total = 3.86 Mop/s/process = 0.96 Operation type = keys ranked Verification = SUCCESSFUL Version = 2.3 Compile date = 30 Jun 2004 </pre> ---- == NAS Parallel Benchmarks 2.2 -- LU Benchmark == <pre> Size: 64x 64x 64 Iterations: 250 Number of processes: 4 Time step 1 Time step 20 Time step 40 Time step 60 Time step 80 Time step 100 Time step 120 Time step 140 Time step 160 Time step 180 Time step 200 Time step 220 Time step 240 Time step 250 Verification being performed for class A Accuracy setting for epsilon = 0.1000000000000E-07 Comparison of RMS-norms of residual 1 0.7790210760669E+03 0.7790210760669E+03 0.1503135748836E-13 2 0.6340276525969E+02 0.6340276525969E+02 0.4258587752173E-14 3 0.1949924972729E+03 0.1949924972729E+03 0.9328509706709E-14 4 0.1784530116042E+03 0.1784530116042E+03 0.9556031307595E-15 5 0.1838476034946E+04 0.1838476034946E+04 0.1100708234961E-13 Comparison of RMS-norms of solution error 1 0.2996408568547E+02 0.2996408568547E+02 0.1067091565711E-14 2 0.2819457636500E+01 0.2819457636500E+01 0.1323073386331E-13 3 0.7347341269878E+01 0.7347341269877E+01 0.5802447794332E-14 4 0.6713922568778E+01 0.6713922568778E+01 0.2645780944304E-15 5 0.7071531568839E+02 0.7071531568839E+02 0.1044985005012E-13 Comparison of surface integral 0.2603092560489E+02 0.2603092560489E+02 0.2729609951429E-15 Verification Successful LU Benchmark Completed. Class = A Size = 64x 64x 64 Iterations = 250 Time in seconds = 513.00 Total processes = 4 Compiled procs = 4 Mop/s total = 232.55 Mop/s/process = 58.14 Operation type = floating point Verification = SUCCESSFUL Version = 2.3 Compile date = 30 Jun 2004 </pre> ---- == NAS Parallel Benchmarks 2.3 -- MG Benchmark == <pre> No input file. Using compiled defaults Size: 256x256x256 (class A) Iterations: 4 Number of processes: 4 Initialization time: 11.724 seconds Benchmark completed VERIFICATION SUCCESSFUL L2 Norm is 0.243336530907E-05 Error is 0.694855007951E-16 MG Benchmark Completed. Class = A Size = 256x256x256 Iterations = 4 Time in seconds = 27.39 Total processes = 4 Compiled procs = 4 Mop/s total = 142.08 Mop/s/process = 35.52 Operation type = floating point Verification = SUCCESSFUL Version = 2.3 Compile date = 30 Jun 2004 </pre> ---- == NAS Parallel Benchmarks 2.3 -- SP Benchmark == <pre> No input file inputsp.data. Using compiled defaults Size: 64x 64x 64 Iterations: 400 dt: 0.001500 Number of active processes: 4 Time step 1 Time step 20 Time step 40 Time step 60 Time step 80 Time step 100 Time step 120 Time step 140 Time step 160 Time step 180 Time step 200 Time step 220 Time step 240 Time step 260 Time step 280 Time step 300 Time step 320 Time step 340 Time step 360 Time step 380 Time step 400 Verification being performed for class A accuracy setting for epsilon = 0.1000000000000E-07 Comparison of RMS-norms of residual 1 0.2479982239930E+01 0.2479982239930E+01 0.9633939753781E-13 2 0.1127633796437E+01 0.1127633796437E+01 0.8624744776582E-13 3 0.1502897788877E+01 0.1502897788877E+01 0.6293907835791E-13 4 0.1421781621170E+01 0.1421781621170E+01 0.4997551840455E-13 5 0.2129211303514E+01 0.2129211303514E+01 0.5318530308612E-13 Comparison of RMS-norms of solution error 1 0.1090014029782E-03 0.1090014029782E-03 0.3834445484860E-12 2 0.3734395176929E-04 0.3734395176928E-04 0.1159500326349E-12 3 0.5009278540654E-04 0.5009278540654E-04 0.9293340428823E-13 4 0.4767109393954E-04 0.4767109393953E-04 0.2105184825798E-12 5 0.1362161339921E-03 0.1362161339921E-03 0.6287946157047E-13 Verification Successful SP Benchmark Completed. Class = A Size = 64x 64x 64 Iterations = 400 Time in seconds = 704.50 Total processes = 4 Compiled procs = 4 Mop/s total = 120.67 Mop/s/process = 30.17 Operation type = floating point Verification = SUCCESSFUL Version = 2.3 Compile date = 30 Jun 2004 </pre>
Summary:
This change is a minor edit.
Username:
Replace this text with a file.