Add subset evaluations support in evaluations module #535

ecsalomon · 2018-12-03T22:11:46Z

No description provided.

ecsalomon · 2018-12-03T22:12:41Z

First step in resolving #519

This commit adds support for evaluating models against subsets of their predictions, in both training and testing. It adds three tables to the results schemas to track subsets and their evaluations: - `model_metadata.subsets` stores subset metadata, including a hash hash, the subset configuration, and the time the row was created - `train_results.subset_evaluations` and `test_results.subset_evaluations` store evaluations for each subset A new alembic upgrade script creates the subsets tables. Testing factories are included for the subsets and subset_evaluations tables, and a test for the factories ensures that the foreign keys in the subset_evaluations tables are correctly configured. Most of the remaining code changes are made to the ModelEvaluator class, which can now process subset queries and write the results to the appropriate table [#535] and will record `NULL` values for undefined metrics (whether due to an empty subset or lack of variation in labels [#138]). However, some changes are made elsewhere in the experiment to allow (optionally) including subsets in the experiment configuration file, including storing subset metadata in the `model_metadata.subsets` table and iterating over subsets in the model tester. In addition, some changes to the documentation and `.gitignore` are included to make modifying the results schema more joyful.

This commit adds support for evaluating models against subsets of their predictions, in both training and testing. It adds three tables to the results schemas to track subsets and their evaluations: - `model_metadata.subsets` stores subset metadata, including a hash, the subset configuration, and the time the row was created - `train_results.subset_evaluations` and `test_results.subset_evaluations` store evaluations for each subset A new alembic upgrade script creates the subsets tables. Testing factories are included for the subsets and subset_evaluations tables, and a test for the factories ensures that the foreign keys in the subset_evaluations tables are correctly configured. Most of the remaining code changes are made to the ModelEvaluator class, which can now process subset queries and write the results to the appropriate table [#535] and will record `NULL` values for undefined metrics (whether due to an empty subset or lack of variation in labels [#138]). However, some changes are made elsewhere in the experiment to allow (optionally) including subsets in the experiment configuration file, including storing subset metadata in the `model_metadata.subsets` table and iterating over subsets in the model tester. In addition, some changes to the documentation and `.gitignore` are included to make modifying the results schema more joyful.

This commit adds support for evaluating models against subsets of their predictions, in both training and testing. It adds a table to the results schemas to track subsets: - `model_metadata.subsets` stores subset metadata, including a hash, the subset configuration, and the time the row was created The `evaluations` tables in the `train_results` and `test_results` schemas are updated to include a new column (also added to the primary key), `subset_hash` that is an empty string for full cohort evaluations or contains the subset hash when the evaluation is for a subset of the cohort. A new alembic upgrade script creates the subsets table and updates the evaluation tables. Testing factories are included or modified for the subsets and evaluation tables. Most of the remaining code changes are made to the ModelEvaluator class, which can now process subset queries and write the results to the appropriate table [#535] and will record `NULL` values for undefined metrics (whether due to an empty subset or lack of variation in labels [#138]). However, some changes are made elsewhere in the experiment to allow (optionally) including subsets in the experiment configuration file, including storing subset metadata in the `model_metadata.subsets` table and iterating over subsets in the model tester. In addition, some changes to the documentation and `.gitignore` are included to make modifying the results schema more joyful.

This commit adds support for evaluating models against subsets of their predictions, in both training and testing. It adds a table to the results schemas to track subsets: - `model_metadata.subsets` stores subset metadata, including a hash, the subset configuration, and the time the row was created The `evaluations` tables in the `train_results` and `test_results` schemas are updated to include a new column (also added to the primary key), `subset_hash` that is an empty string for full cohort evaluations or contains the subset hash when the evaluation is for a subset of the cohort. A new alembic upgrade script creates the subsets table and updates the evaluation tables. Testing factories are included or modified for the subsets and evaluation tables. Most of the remaining code changes are made to the ModelEvaluator class, which can now process subset queries and write the results to the appropriate table [#535] and will record `NULL` values for undefined metrics (whether due to an empty subset or lack of variation in labels [#138]). WIP: Preparation for a more subsets-like experience, where a subset table is built initially from the user-input query and then used at evaluation time. The first step in this is renaming the cohort generators to entity_date table generators, as the code will have a more generic function. However, some changes are made elsewhere in the experiment to allow (optionally) including subsets in the experiment configuration file, including storing subset metadata in the `model_metadata.subsets` table and iterating over subsets in the model tester. In addition, some changes to the documentation and `.gitignore` are included to make modifying the results schema more joyful.

This commit adds support for evaluating models against subsets of their predictions, in both training and testing. It adds a table to the results schemas to track subsets: - `model_metadata.subsets` stores subset metadata, including a hash, the subset configuration, and the time the row was created The `evaluations` tables in the `train_results` and `test_results` schemas are updated to include a new column (also added to the primary key), `subset_hash` that is an empty string for full cohort evaluations or contains the subset hash when the evaluation is for a subset of the cohort. A new alembic upgrade script creates the subsets table and updates the evaluation tables. Testing factories are included or modified for the subsets and evaluation tables. Most of the remaining code changes are made to the ModelEvaluator class, which can now process subset queries and write the results to the appropriate table [#535] and will record `NULL` values for undefined metrics (whether due to an empty subset or lack of variation in labels [#138]). However, some changes are made elsewhere in the experiment to allow (optionally) including subsets in the experiment configuration file, including storing subset metadata in the `model_metadata.subsets` table and iterating over subsets in the model tester. In addition, some changes to the documentation and `.gitignore` are included to make modifying the results schema more joyful.

ecsalomon self-assigned this Dec 3, 2018

ecsalomon mentioned this issue Dec 3, 2018

Add re-evaluation cli command #536

Open

ecsalomon mentioned this issue Dec 10, 2018

Evaluate on subsets [Resolves #535, #138] #552

Merged

thcrock closed this as completed in 2c0a2bc Feb 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add subset evaluations support in evaluations module #535

Add subset evaluations support in evaluations module #535

ecsalomon commented Dec 3, 2018

ecsalomon commented Dec 3, 2018

Add subset evaluations support in evaluations module #535

Add subset evaluations support in evaluations module #535

Comments

ecsalomon commented Dec 3, 2018

ecsalomon commented Dec 3, 2018