Verbose config check #483

avishekrk · 2018-10-29T21:55:24Z

This PR adds:

verbose output for the config validator to better check errors
outputs the timechop figure to a png

thcrock · 2018-10-30T19:57:19Z

src/triage/experiments/validate.py

@@ -328,16 +328,19 @@ def _validate_imputations(self, aggregation_config):
        agg_types = ["aggregates", "categoricals", "array_categoricals"]

        for agg_type in agg_types:
+            logging.info('agg_type:{}'.format(agg_type))


Use logging variables instead of formatted strings:
logging.info('agg_type: %s', agg_type)

This has two advantanges:

logging module doesn't do the interpolation unless the message is actually visible (minor win)

and enables log aggregators to more intelligently aggregate logfiles. For instance, if a message gets logged multiple times with different arguments, log aggregators are smart enough to aggregate them together but this is not possible with formatted strings.

And the intros should be a bit more descriptive. 'Checking imputation rules for aggregation type %s', 'Checking imputation rules for aggregation %s', 'Checking imputation rules for metric %s'

thcrock · 2018-10-30T21:51:23Z

src/triage/component/timechop/plotting.py

@@ -110,4 +112,5 @@ def visualize_chops(chopper, show_as_of_times=True, show_boundaries=True):
    ax[0].set_title("Timechop: Temporal cross-validation blocks")
    fig.subplots_adjust(hspace=0)
    plt.setp([a.get_xticklabels() for a in fig.axes[:-1]], visible=False)
+    plt.savefig('timechop.png')


Isn't this going to break (and never call show) if whatever is running doesn't have write access to whatever directory this ends up being? Furthermore, if this breaks in a notebook setting (when the user is just trying to plot it to the screen), it will break their simple displaying use case for the sake of filesystem saving functionality that they don't even care about.

I think there's a lot that we can do to make this more robust (integrate it with the ProjectStorage via the CLI, etc) but the bare minimum for this pull request I think should be to just plot to an optional target instead of hardcoding a filesystem path. We do this in the audition plotting module for an example.

https://github.com/dssg/triage/blob/master/src/triage/component/audition/plotting.py#L217-L218

Just have the save target be an optional argument, and conditionally save it like this:

if path_to_save: plt.savefig(path_to_save)

And we can make the implementation more complete in a future PR

thcrock and others added 2 commits October 23, 2018 11:30

Testing session close

5c649d1

added verbose config checks solves #459

7eebd50

avishekrk requested a review from thcrock October 29, 2018 21:55

thcrock reviewed Oct 30, 2018

View reviewed changes

thcrock mentioned this pull request Nov 20, 2018

showtimechops CLI option not working #512

Closed

Logging fixes and make save optional

8358ce4

thcrock self-assigned this Dec 4, 2018

thcrock merged commit fb42275 into master Dec 4, 2018

thcrock deleted the verbose_config_check branch December 4, 2018 20:36

thcrock mentioned this pull request Dec 7, 2018

Validator should be more verbose about imputation rules #459

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verbose config check #483

Verbose config check #483

avishekrk commented Oct 29, 2018

thcrock Oct 30, 2018

thcrock Oct 30, 2018 •

edited

Loading

Verbose config check #483

Verbose config check #483

Conversation

avishekrk commented Oct 29, 2018

thcrock Oct 30, 2018

Choose a reason for hiding this comment

thcrock Oct 30, 2018 • edited Loading

Choose a reason for hiding this comment

thcrock Oct 30, 2018 •

edited

Loading