What is our notion of best-fit for generation, prediction, and relaxation? #12

sgbaird · 2022-05-21T21:38:50Z

EDIT: see also issues with the "notion of best" label

Relaxation is probably the most straightforward - use some crystal distance. Prediction can be about checking against known allotropes, where we take the lowest crystal distance among the allotropes. Generation is the least straightforward. Perhaps a Pareto hypervolume metric via a fictitious adaptive design campaign (e.g. bulk modulus vs. energy above hull)? Perform hyperparameter optimization and then do DFT as the final validation.

sgbaird · 2022-05-28T19:17:59Z

Another option that struck me is using a time-split. For example:

Split Materials Project into two pieces based on a datetime split
unconditionally generate many crystal structures
1. 10+ million? Maybe check convergence with number of generated structures (a hyperparameter of the metric)
check fraction of how many close matches with latter half of Materials Project entries to total number of latter half, with higher fraction--> better performance
1. match tolerance(s) will be other hyperparameter(s) for the metric

sgbaird · 2022-06-01T01:55:48Z

Also can take a look at the model accuracy for Matbench task(s) as a way to probe the "quality" of the xtal2png representation from another perspective #50

sgbaird · 2022-06-01T19:53:25Z

DFT simulations will also be important as a high-cost validation.

sgbaird · 2022-06-10T15:51:49Z

From mp-time-split:

... MPTS-52 can be used with the metrics introduced in CDVAE's compute_metrics.py script (see txie-93/cdvae#10. ...

sgbaird · 2022-06-12T05:47:58Z

Having trouble getting CDVAE to run txie-93/cdvae#19, but can probably splice out the compute_metrics.py while that's getting sorted out.

sgbaird · 2022-06-16T04:14:56Z

compute_metrics.py seems to be tightly integrated with the rest of the codebase. Simplest solution might just be to fork CDVAE, make it pip- and conda-installable, and then include it as a dependency for matbench-genmetrics.

sgbaird · 2022-06-24T02:35:06Z

Might hold off on CDVAE metrics for now. See txie-93/cdvae#10

sgbaird · 2022-08-20T07:16:42Z

As an update, matbench-genmetrics runs in a reasonable time now https://github.com/sparks-baird/matbench-genmetrics/blob/main/notebooks/1.0-matbench-genmetrics-basic.ipynb

sgbaird mentioned this issue May 21, 2022

Consider having certain feature ranges as tunable parameters #13

Closed

sgbaird mentioned this issue Jun 1, 2022

Create time-splits for Materials Project for percent matched notion of best #54

Closed

sgbaird added the notion-of-best Notions of best fit, i.e. how to characterize quality of generated structures. label Jun 1, 2022

This was referenced Jun 1, 2022

Check whether generated structures are affine matches of structures in the training data #10

Closed

Materials Project time split dataset - load_data_from_json returns None during debugging (conditionally) hackingmaterials/matminer#832

Open

sgbaird self-assigned this Jun 11, 2022

sgbaird mentioned this issue Jun 14, 2022

Create Colab notebook with denoising_diffusion_pytorch example #57

Closed

sgbaird mentioned this issue Jun 22, 2022

How to calculate RMSE for a predicted a structure and a true structure? Tomoki-YAMASHITA/CrySPY#5

Closed

sgbaird pinned this issue Jul 8, 2022

sgbaird mentioned this issue Aug 20, 2022

use xtal2png with imagen-pytorch and matbench-genmetrics #204

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is our notion of best-fit for generation, prediction, and relaxation? #12

What is our notion of best-fit for generation, prediction, and relaxation? #12

sgbaird commented May 21, 2022 •

edited

Loading

sgbaird commented May 28, 2022

sgbaird commented Jun 1, 2022

sgbaird commented Jun 1, 2022

sgbaird commented Jun 10, 2022

sgbaird commented Jun 12, 2022

sgbaird commented Jun 16, 2022

sgbaird commented Jun 24, 2022

sgbaird commented Aug 20, 2022

What is our notion of best-fit for generation, prediction, and relaxation? #12

What is our notion of best-fit for generation, prediction, and relaxation? #12

Comments

sgbaird commented May 21, 2022 • edited Loading

sgbaird commented May 28, 2022

sgbaird commented Jun 1, 2022

sgbaird commented Jun 1, 2022

sgbaird commented Jun 10, 2022

sgbaird commented Jun 12, 2022

sgbaird commented Jun 16, 2022

sgbaird commented Jun 24, 2022

sgbaird commented Aug 20, 2022

sgbaird commented May 21, 2022 •

edited

Loading