Smoke test should abort early if the run fails #608

phil-blain · 2021-06-15T15:50:28Z

The smoke test does not exit with failure when the CICE run fails, it simply write the FAIL result to the test_output file:

CICE/configuration/scripts/tests/test_smoke.script

Lines 6 to 7 in 1c6c225

    
           ./cice.run 
        
           set res="$status"

[...]

CICE/configuration/scripts/tests/test_smoke.script

Lines 23 to 29 in 1c6c225

    
           set grade = FAIL 
        
           if ( $res == 0 ) then 
        
             set grade = PASS 
        
           endif 
        
           echo "$grade ${ICE_TESTNAME} run ${ttimeloop} ${tdynamics} ${tcolumn}" >> ${ICE_CASEDIR}/test_output 
        
           echo "$grade ${ICE_TESTNAME} test " >> ${ICE_CASEDIR}/test_output

In contrast, the restart test does exit 99:

CICE/configuration/scripts/tests/test_restart.script

Lines 10 to 22 in 1c6c225

    
           ./cice.run 
        
           set res="$status" 
        
           if ( $res != 0 ) then 
        
             mv -f ${ICE_CASEDIR}/test_output ${ICE_CASEDIR}/test_output.prev 
        
             cat ${ICE_CASEDIR}/test_output.prev | grep -iv "${ICE_TESTNAME} run" >! ${ICE_CASEDIR}/test_output 
        
             mv -f ${ICE_CASEDIR}/test_output ${ICE_CASEDIR}/test_output.prev 
        
             cat ${ICE_CASEDIR}/test_output.prev | grep -iv "${ICE_TESTNAME} test " >! ${ICE_CASEDIR}/test_output 
        
             rm -f ${ICE_CASEDIR}/test_output.prev 
        
             echo "FAIL ${ICE_TESTNAME} run" >> ${ICE_CASEDIR}/test_output 
        
             echo "FAIL ${ICE_TESTNAME} test " >> ${ICE_CASEDIR}/test_output 
        
             exit 99 
        
           endif

This makes more sense in my opinion. If the initial run fails it makes no sense to go on and try to do the baseline generation, baseline comparing and BFB compare step, no ?

This can lead to misleading "missing data" results when in fact it is the data for the current test that is missing (because the run failed) and not the data for the run which we are comparing against.

The text was updated successfully, but these errors were encountered:

apcraig · 2021-06-15T16:01:27Z

Thanks for catching this @phil-blain. What you propose makes sense. I can include this fix in my next PR unless you create a PR first.

phil-blain · 2021-06-15T16:04:21Z

You can go ahead. Might be worth it to check the other tests at the same time. From a quick look the decomp, logbfb and unittest tests have the same behaviour.

- Fix bugs in history/restart frequency associated with new calendar (CICE-Consortium#589) - Define frequency in absolute terms relative to 0000-01-01-00000 and document (CICE-Consortium#589) - Update set_nml.histall to include hourly output (CICE-Consortium#589) - Update test scripts to cleanly abort if run fails where possible (CICE-Consortium#608) - Update decomp test so it's rerunable, remove restart at start of run (CICE-Consortium#601) - Add ability to do bfbcomp tests with additional options set on command line (CICE-Consortium#569) - Update documentation of calendar frequency computation, calendar types, and closed boundaries (CICE-Consortium#541) - Add optional doabort flag to abort_ice to control whether the method aborts. This is useful for testing and code coverage statistics, although doabort=.false. will not call the actual abort method, but we can test the interfaces and rest of the code.

* Fix history/restart frequency bugs and update scripts - Fix bugs in history/restart frequency associated with new calendar (#589) - Update set_nml.histall to include hourly output (#589) - Update test scripts to cleanly abort if run fails where possible (#608) - Update decomp test so it's rerunable, remove restart at start of run (#601) - Add ability to do bfbcomp tests with additional options set on command line (#569) - Update documentation of calendar frequency computation, calendar types, and closed boundaries (#541) - Add optional doabort flag to abort_ice to control whether the method aborts. This is useful for testing and code coverage statistics, although doabort=.false. will not call the actual abort method, but we can test the interfaces and rest of the code. - Add histfreq_base and dumpfreq_base ('init' or 'zero') to specify reference data for history and restart output. Defaults are 'zero' and 'init' respectively for hist and dump. Setting histfreq_base to 'zero' allows for consistent output across multiple runs. Setting dumpfreq_base to 'init' allows the standard testing which requires restarts be written, for example, 5 days after the start of the run. - Remove extra abort calls in bcstchk and sumchk on runs that complete fine but don't pass checks. These aborts should never have been there. - Update documentation. - Clean up some of the unit tests to better support regression testing - modify initial/restart implementation - restart namelist is deprecated, now computed internally - modify initial/continue init checks and set restart and use_restart_time as needed - create compute_relative_elapsed method in ice_calendar to improve code reuse - update documentation with regard to initial/continue modes - Set default use_restart_time to false

phil-blain added Testing Scripts labels Jun 15, 2021

apcraig self-assigned this Jun 19, 2021

apcraig mentioned this issue Jun 19, 2021

Fix history and restart frequency, new features to scripts #610

Merged

16 tasks

apcraig closed this as completed in #610 Jul 2, 2021

phil-blain mentioned this issue May 12, 2022

Investigate decomp_suite failures with dynpicard option phil-blain/CICE#39

Open

phil-blain mentioned this issue Aug 10, 2022

Reusing test suite - compare with baseline passes even if model does not run correctly phil-blain/CICE#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Smoke test should abort early if the run fails #608

Smoke test should abort early if the run fails #608

phil-blain commented Jun 15, 2021

apcraig commented Jun 15, 2021

phil-blain commented Jun 15, 2021 •

edited

Loading

Smoke test should abort early if the run fails #608

Smoke test should abort early if the run fails #608

Comments

phil-blain commented Jun 15, 2021

apcraig commented Jun 15, 2021

phil-blain commented Jun 15, 2021 • edited Loading

phil-blain commented Jun 15, 2021 •

edited

Loading