Fetch (optional) root-sequence JSON #1197

jameshadfield · 2020-08-07T06:47:30Z

Auspice has had a long-standing issue where choosing a genotype for a position where no mutations were observed resulted in an uninformative coloring of the tree. This is because we don't store the ancestral (root) sequence in the dataset JSON and thus rely on mutations to infer it.

Upon dataset load we now make a request for the "root-sequence" sidecar file. If this request is successful we use the data to color genotypes for which there are no mutations. The get-dataset script (which runs for auspice heroku deployments) has been modified to fetch vic & yam from the staging server, as corresponding root-sequence JSONs were present there but not on nextstrain-data.

You can see this in action via https://auspice-root-seq-doeexpnizvfdt.herokuapp.com/flu/seasonal/vic/ha/3y?c=gt-HA2_120 -- notice how the tree starts off with a grey coloring and then once the root-sequence JSON request arrives the coloring updates to show the correct AA (T). Contrast this with https://auspice-root-seq-doeexpnizvfdt.herokuapp.com/flu/seasonal/h3n2/ha/3y?c=gt-HA2_120 for which the root-sequence JSON isn't available.

@emmahodcroft do you have a root-sequence JSON for a TB dataset? Presumably that will be a large JSON and it'd be good to use that for testing to see if there are any issues.

This modifies the default workflow to produce the sidecar root-sequence JSON for each build. This is in preparation for nextstrain/auspice#1197 (see there for an explanation of the advantages this JSON gives us)

rneher · 2020-09-29T19:55:09Z

this would be quite useful also for nextclade. we could then fetch the root sequence along with the tree and make it work for any of our builds.

Currently genotypes are unknown for positions without any mutations, as it is through mutations we infer the appropriate values to display. This commit changes the coloring to be grey in such cases, rather than blue/green.

Upon dataset load we now make a request for the "root-sequence" sidecar file. If this request is successful we use the data to color genotypes for which there are no mutations (previously the nt/aa at such positions weren't known)

This switches to getting vic & yam from the nextstrain staging bucket as the root-sequence JSONs weren't present on the data bucket

jameshadfield · 2020-10-07T02:00:11Z

Merging after rebasing onto master & retesting

This modifies the default workflow to produce the sidecar root-sequence JSON for each build. See nextstrain/auspice#1197 for the functionality this file gives Auspice.

jameshadfield temporarily deployed to auspice-root-seq-doeexpnizvfdt August 7, 2020 06:47 Inactive

jameshadfield force-pushed the root-seq branch from 583e6a7 to 5f624c1 Compare August 7, 2020 06:54

jameshadfield temporarily deployed to auspice-root-seq-doeexpnizvfdt August 7, 2020 06:54 Inactive

jameshadfield mentioned this pull request Aug 7, 2020

produce root-sequence sidecar JSON nextstrain/ncov#479

Merged

jameshadfield added 3 commits October 7, 2020 14:27

Change coloring for unknown genotypes to grey

66b2167

Currently genotypes are unknown for positions without any mutations, as it is through mutations we infer the appropriate values to display. This commit changes the coloring to be grey in such cases, rather than blue/green.

Update get-data script to fetch root-sequence JSONs

616d119

This switches to getting vic & yam from the nextstrain staging bucket as the root-sequence JSONs weren't present on the data bucket

jameshadfield force-pushed the root-seq branch from 5f624c1 to 616d119 Compare October 7, 2020 01:28

jameshadfield temporarily deployed to auspice-root-seq-oafyzuc8kjwa3 October 7, 2020 01:37 Inactive

jameshadfield merged commit 29dca85 into master Oct 7, 2020

jameshadfield deleted the root-seq branch October 7, 2020 02:00

jameshadfield mentioned this pull request Dec 10, 2020

Sequence for non-variable sites #616

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fetch (optional) root-sequence JSON #1197

Fetch (optional) root-sequence JSON #1197

jameshadfield commented Aug 7, 2020 •

edited

Loading

rneher commented Sep 29, 2020

jameshadfield commented Oct 7, 2020

Fetch (optional) root-sequence JSON #1197

Fetch (optional) root-sequence JSON #1197

Conversation

jameshadfield commented Aug 7, 2020 • edited Loading

rneher commented Sep 29, 2020

jameshadfield commented Oct 7, 2020

jameshadfield commented Aug 7, 2020 •

edited

Loading