cclm driven by ERA5, int2lm reports error – in #10: INT2LM

in #10: INT2LM

<p> more detailed error info: </p> <p> <span class="caps"> YDATE </span> _INI 1979010100 <br/> ——- start INT2CLM 0: cpu-bind=MASK – m10699, task 0 0 [8437]: mask 0×1000001 set <br/> 23: cpu-bind=MASK – m10699, task 23 23 [8460]: mask 0×800000800000 set <br/> 14: cpu-bind=MASK – m10699, task 14 14 [8451]: mask 0×80000080 set <br/> 22: cpu-bind=MASK – m10699, task 22 22 [8459]: mask 0×800000800 set 6: cpu-bind=MASK – m10699, task 6 6 [8443]: mask 0×8000008 set <br/> 15: cpu-bind=MASK – m10699, task 15 15 [8452]: mask 0×80000080000 set <br/> 20: cpu-bind=MASK – m10699, task 20 20 [8457]: mask 0×400000400 set 7: cpu-bind=MASK – m10699, task 7 7 [8444]: mask 0×8000008000 set <br/> 19: cpu-bind=MASK – m10699, task 19 19 [8456]: mask 0×200000200000 set 5: cpu-bind=MASK – m10699, task 5 5 [8442]: mask 0×4000004000 set 1: cpu-bind=MASK – m10699, task 1 1 [8438]: mask 0×1000001000 set <br/> 18: cpu-bind=MASK – m10699, task 18 18 [8455]: mask 0×200000200 set 3: cpu-bind=MASK – m10699, task 3 3 [8440]: mask 0×2000002000 set <br/> 16: cpu-bind=MASK – m10699, task 16 16 [8453]: mask 0×100000100 set <br/> 21: cpu-bind=MASK – m10699, task 21 21 [8458]: mask 0×400000400000 set <br/> 17: cpu-bind=MASK – m10699, task 17 17 [8454]: mask 0×100000100000 set 9: cpu-bind=MASK – m10699, task 9 9 [8446]: mask 0×10000010000 set 4: cpu-bind=MASK – m10699, task 4 4 [8441]: mask 0×4000004 set 8: cpu-bind=MASK – m10699, task 8 8 [8445]: mask 0×10000010 set <br/> 12: cpu-bind=MASK – m10699, task 12 12 [8449]: mask 0×40000040 set <br/> 11: cpu-bind=MASK – m10699, task 11 11 [8448]: mask 0×20000020000 set <br/> 13: cpu-bind=MASK – m10699, task 13 13 [8450]: mask 0×40000040000 set 2: cpu-bind=MASK – m10699, task 2 2 [8439]: mask 0×2000002 set <br/> 10: cpu-bind=MASK – m10699, task 10 10 [8447]: mask 0×20000020 set 0: <span class="caps"> SETUP </span> OF INT2LM 0: <span class="caps"> INITIALIZATIONS </span> 0: Info about <span class="caps"> KIND </span> -parameters: iintegers / <span class="caps"> MPI </span> _INT = 4 0: 7 0: int_ga / <span class="caps"> MPI </span> _INT = 4 0: 7 0: <span class="caps"> INPUT </span> OF <span class="caps"> THE </span> NAMELISTS 0: *** <span class="caps"> NOTE </span> : Old 10 digit date format is used for output files of INT2LM 0: *** specifications of input soil main levels *** 0: *** from Namelist <span class="caps"> INPUT </span> are used *** 0: *** specifications of LM soil main levels *** 0: *** from Namelist <span class="caps"> INPUT </span> are used *** 0: *** A default set for vcoord parameters is used: 2 0: *** A default set for refatm parameters is used: 2 0: 0: Code information used to build this binary 0: Binary name ….: tstint2lm 0: 0: Library name ……: int2lm 0: Tag name ……….: V2_0 0: Checkin-Date ……: 2013-11-01 14:28:13 0: Code is modified ..: .false. 0: Compile-Date ……: 0: Compiled by …….: uschaett 0: 0: Current start time : 2019-12-19 15:29 0: Running on nodes ..: 0: Data decomposition : 0: End of code information 0: 0: <strong> —————————————————————————————— </strong> 0: * <span class="caps"> PROGRAM </span> <span class="caps"> TERMINATED </span> <span class="caps"> BECAUSE </span> OF <span class="caps"> ERRORS </span> DETECTED 0: * IN <span class="caps"> ROUTINE </span> : int2lm_org 0: * 0: * <span class="caps"> ERROR </span> <span class="caps"> CODE </span> is 1002 0: * <span class="caps"> ERROR </span> *** Wrong values occured in <span class="caps"> NAMELIST </span> input *** 0: <strong> —————————————————————————————— </strong> 0: ————————————————————————————————————— 0: <span class="caps"> MPI </span> _ABORT was invoked on rank 0 in communicator <span class="caps"> MPI </span> _COMM_WORLD 0: with errorcode 1002. 0: 0: <span class="caps"> NOTE </span> : invoking <span class="caps"> MPI </span> _ABORT causes Open <span class="caps"> MPI </span> to kill all <span class="caps"> MPI </span> processes. 0: You may or may not see output from other processes, depending on 0: exactly when Open <span class="caps"> MPI </span> kills them. 0: ————————————————————————————————————— <br/> srun: Job step aborted: Waiting up to 32 seconds for job step to finish. 0: slurmstepd: error: *** <span class="caps"> STEP </span> 18809355.0 ON m10699 <span class="caps"> CANCELLED </span> AT 2019-12-19T15:29:05 *** <br/> srun: error: m10699: task 0: Exited with exit code 234 <br/> srun: Terminating job step 18809355.0 <br/> srun: error: m10699: tasks 1-23: Killed </p>

  @redc_migration in #48dba0b

<p> more detailed error info: </p> <p> <span class="caps"> YDATE </span> _INI 1979010100 <br/> ——- start INT2CLM 0: cpu-bind=MASK – m10699, task 0 0 [8437]: mask 0×1000001 set <br/> 23: cpu-bind=MASK – m10699, task 23 23 [8460]: mask 0×800000800000 set <br/> 14: cpu-bind=MASK – m10699, task 14 14 [8451]: mask 0×80000080 set <br/> 22: cpu-bind=MASK – m10699, task 22 22 [8459]: mask 0×800000800 set 6: cpu-bind=MASK – m10699, task 6 6 [8443]: mask 0×8000008 set <br/> 15: cpu-bind=MASK – m10699, task 15 15 [8452]: mask 0×80000080000 set <br/> 20: cpu-bind=MASK – m10699, task 20 20 [8457]: mask 0×400000400 set 7: cpu-bind=MASK – m10699, task 7 7 [8444]: mask 0×8000008000 set <br/> 19: cpu-bind=MASK – m10699, task 19 19 [8456]: mask 0×200000200000 set 5: cpu-bind=MASK – m10699, task 5 5 [8442]: mask 0×4000004000 set 1: cpu-bind=MASK – m10699, task 1 1 [8438]: mask 0×1000001000 set <br/> 18: cpu-bind=MASK – m10699, task 18 18 [8455]: mask 0×200000200 set 3: cpu-bind=MASK – m10699, task 3 3 [8440]: mask 0×2000002000 set <br/> 16: cpu-bind=MASK – m10699, task 16 16 [8453]: mask 0×100000100 set <br/> 21: cpu-bind=MASK – m10699, task 21 21 [8458]: mask 0×400000400000 set <br/> 17: cpu-bind=MASK – m10699, task 17 17 [8454]: mask 0×100000100000 set 9: cpu-bind=MASK – m10699, task 9 9 [8446]: mask 0×10000010000 set 4: cpu-bind=MASK – m10699, task 4 4 [8441]: mask 0×4000004 set 8: cpu-bind=MASK – m10699, task 8 8 [8445]: mask 0×10000010 set <br/> 12: cpu-bind=MASK – m10699, task 12 12 [8449]: mask 0×40000040 set <br/> 11: cpu-bind=MASK – m10699, task 11 11 [8448]: mask 0×20000020000 set <br/> 13: cpu-bind=MASK – m10699, task 13 13 [8450]: mask 0×40000040000 set 2: cpu-bind=MASK – m10699, task 2 2 [8439]: mask 0×2000002 set <br/> 10: cpu-bind=MASK – m10699, task 10 10 [8447]: mask 0×20000020 set 0: <span class="caps"> SETUP </span> OF INT2LM 0: <span class="caps"> INITIALIZATIONS </span> 0: Info about <span class="caps"> KIND </span> -parameters: iintegers / <span class="caps"> MPI </span> _INT = 4 0: 7 0: int_ga / <span class="caps"> MPI </span> _INT = 4 0: 7 0: <span class="caps"> INPUT </span> OF <span class="caps"> THE </span> NAMELISTS 0: *** <span class="caps"> NOTE </span> : Old 10 digit date format is used for output files of INT2LM 0: *** specifications of input soil main levels *** 0: *** from Namelist <span class="caps"> INPUT </span> are used *** 0: *** specifications of LM soil main levels *** 0: *** from Namelist <span class="caps"> INPUT </span> are used *** 0: *** A default set for vcoord parameters is used: 2 0: *** A default set for refatm parameters is used: 2 0: 0: Code information used to build this binary 0: Binary name ….: tstint2lm 0: 0: Library name ……: int2lm 0: Tag name ……….: V2_0 0: Checkin-Date ……: 2013-11-01 14:28:13 0: Code is modified ..: .false. 0: Compile-Date ……: 0: Compiled by …….: uschaett 0: 0: Current start time : 2019-12-19 15:29 0: Running on nodes ..: 0: Data decomposition : 0: End of code information 0: 0: <strong> —————————————————————————————— </strong> 0: * <span class="caps"> PROGRAM </span> <span class="caps"> TERMINATED </span> <span class="caps"> BECAUSE </span> OF <span class="caps"> ERRORS </span> DETECTED 0: * IN <span class="caps"> ROUTINE </span> : int2lm_org 0: * 0: * <span class="caps"> ERROR </span> <span class="caps"> CODE </span> is 1002 0: * <span class="caps"> ERROR </span> *** Wrong values occured in <span class="caps"> NAMELIST </span> input *** 0: <strong> —————————————————————————————— </strong> 0: ————————————————————————————————————— 0: <span class="caps"> MPI </span> _ABORT was invoked on rank 0 in communicator <span class="caps"> MPI </span> _COMM_WORLD 0: with errorcode 1002. 0: 0: <span class="caps"> NOTE </span> : invoking <span class="caps"> MPI </span> _ABORT causes Open <span class="caps"> MPI </span> to kill all <span class="caps"> MPI </span> processes. 0: You may or may not see output from other processes, depending on 0: exactly when Open <span class="caps"> MPI </span> kills them. 0: ————————————————————————————————————— <br/> srun: Job step aborted: Waiting up to 32 seconds for job step to finish. 0: slurmstepd: error: *** <span class="caps"> STEP </span> 18809355.0 ON m10699 <span class="caps"> CANCELLED </span> AT 2019-12-19T15:29:05 *** <br/> srun: error: m10699: task 0: Exited with exit code 234 <br/> srun: Terminating job step 18809355.0 <br/> srun: error: m10699: tasks 1-23: Killed </p>

more detailed error info:

YDATE _INI 1979010100
——- start INT2CLM 0: cpu-bind=MASK – m10699, task 0 0 [8437]: mask 0×1000001 set
23: cpu-bind=MASK – m10699, task 23 23 [8460]: mask 0×800000800000 set
14: cpu-bind=MASK – m10699, task 14 14 [8451]: mask 0×80000080 set
22: cpu-bind=MASK – m10699, task 22 22 [8459]: mask 0×800000800 set 6: cpu-bind=MASK – m10699, task 6 6 [8443]: mask 0×8000008 set
15: cpu-bind=MASK – m10699, task 15 15 [8452]: mask 0×80000080000 set
20: cpu-bind=MASK – m10699, task 20 20 [8457]: mask 0×400000400 set 7: cpu-bind=MASK – m10699, task 7 7 [8444]: mask 0×8000008000 set
19: cpu-bind=MASK – m10699, task 19 19 [8456]: mask 0×200000200000 set 5: cpu-bind=MASK – m10699, task 5 5 [8442]: mask 0×4000004000 set 1: cpu-bind=MASK – m10699, task 1 1 [8438]: mask 0×1000001000 set
18: cpu-bind=MASK – m10699, task 18 18 [8455]: mask 0×200000200 set 3: cpu-bind=MASK – m10699, task 3 3 [8440]: mask 0×2000002000 set
16: cpu-bind=MASK – m10699, task 16 16 [8453]: mask 0×100000100 set
21: cpu-bind=MASK – m10699, task 21 21 [8458]: mask 0×400000400000 set
17: cpu-bind=MASK – m10699, task 17 17 [8454]: mask 0×100000100000 set 9: cpu-bind=MASK – m10699, task 9 9 [8446]: mask 0×10000010000 set 4: cpu-bind=MASK – m10699, task 4 4 [8441]: mask 0×4000004 set 8: cpu-bind=MASK – m10699, task 8 8 [8445]: mask 0×10000010 set
12: cpu-bind=MASK – m10699, task 12 12 [8449]: mask 0×40000040 set
11: cpu-bind=MASK – m10699, task 11 11 [8448]: mask 0×20000020000 set
13: cpu-bind=MASK – m10699, task 13 13 [8450]: mask 0×40000040000 set 2: cpu-bind=MASK – m10699, task 2 2 [8439]: mask 0×2000002 set
10: cpu-bind=MASK – m10699, task 10 10 [8447]: mask 0×20000020 set 0: SETUP OF INT2LM 0: INITIALIZATIONS 0: Info about KIND -parameters: iintegers / MPI _INT = 4 0: 7 0: int_ga / MPI _INT = 4 0: 7 0: INPUT OF THE NAMELISTS 0: *** NOTE : Old 10 digit date format is used for output files of INT2LM 0: *** specifications of input soil main levels *** 0: *** from Namelist INPUT are used *** 0: *** specifications of LM soil main levels *** 0: *** from Namelist INPUT are used *** 0: *** A default set for vcoord parameters is used: 2 0: *** A default set for refatm parameters is used: 2 0: 0: Code information used to build this binary 0: Binary name ….: tstint2lm 0: 0: Library name ……: int2lm 0: Tag name ……….: V2_0 0: Checkin-Date ……: 2013-11-01 14:28:13 0: Code is modified ..: .false. 0: Compile-Date ……: 0: Compiled by …….: uschaett 0: 0: Current start time : 2019-12-19 15:29 0: Running on nodes ..: 0: Data decomposition : 0: End of code information 0: 0: —————————————————————————————— 0: * PROGRAM TERMINATED BECAUSE OF ERRORS DETECTED 0: * IN ROUTINE : int2lm_org 0: * 0: * ERROR CODE is 1002 0: * ERROR *** Wrong values occured in NAMELIST input *** 0: —————————————————————————————— 0: ————————————————————————————————————— 0: MPI _ABORT was invoked on rank 0 in communicator MPI _COMM_WORLD 0: with errorcode 1002. 0: 0: NOTE : invoking MPI _ABORT causes Open MPI to kill all MPI processes. 0: You may or may not see output from other processes, depending on 0: exactly when Open MPI kills them. 0: —————————————————————————————————————
srun: Job step aborted: Waiting up to 32 seconds for job step to finish. 0: slurmstepd: error: *** STEP 18809355.0 ON m10699 CANCELLED AT 2019-12-19T15:29:05 ***
srun: error: m10699: task 0: Exited with exit code 234
srun: Terminating job step 18809355.0
srun: error: m10699: tasks 1-23: Killed