Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Move VEP to Requester Pays #8268

Merged
merged 15 commits into from
Mar 9, 2020
Merged

Conversation

johnc1231
Copy link
Contributor

This PR changes our vep init scripts to pull from the new hail-us-vep google bucket, which is requester pays.

I realized while doing this that my previous PR (#8253) really had nothing to do with supporting this, since that really just sets the configuration for the Google Cloud / Hadoop connector, which is not how we get vep data. I tried to enforce the rules from those command line arguments anyway by rejecting --vep flag if they don't specify they're ok with requester pays and manually checking that hail-us-vep was in their approved bucket list. But if a user was to specify the init scripts using gcloud dataproc and didn't go through hailctl, there'd be no catch to check if they were ok with requester pays. Perhaps there is some way we could set environment variables on the dataproc machines based on the --requester-pays-allow.... flags and use those in the init scripts.

I also expanded make test-dataproc to test with a GRCh38 cluster as well, as we use separate scripts to make them and I'm uncomfortable with only testing one.

@johnc1231
Copy link
Contributor Author

Adding "WIP" tag since I had a test failure from make test-dataproc. Seems like I have a mix of reference genomes somewhere.

@johnc1231
Copy link
Contributor Author

Ah, the mix of reference genomes is coming from me running all the "cluster_tests" for GRCh38 cluster as well. One of the tests hardcodes 37

@johnc1231
Copy link
Contributor Author

Ok, all tests passing now. Removing WIP

@johnc1231 johnc1231 added prio:high and removed WIP labels Mar 9, 2020
Copy link
Contributor

@tpoterba tpoterba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

use forum link

conf.extend_flag('initialization-actions', [deploy_metadata[f'vep-{args.vep}.sh']])
sys.stderr.write("WARNING: Running VEP outside of the US is VERY expensive. Please contact us for support.")
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"Please contact the Hail team on https://discuss.hail.is for support."

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(and in the message above)

@tpoterba tpoterba changed the title Move Vep to Requester Pays Move VEP to Requester Pays Mar 9, 2020
@danking danking merged commit 99d2952 into hail-is:master Mar 9, 2020
danking pushed a commit to danking/hail that referenced this pull request Sep 6, 2023
This appers to have been lost in hail-is#8268. There is no explicit mention of it and it was inconsistently
removed (e.g. loftee in GRCh38 still has the `&`). I assume it was a mistake.
danking added a commit that referenced this pull request Sep 6, 2023
This appers to have been lost in #8268. There is no explicit mention of
it and it was inconsistently removed (e.g. loftee in GRCh38 still has
the `&`). I assume it was a mistake.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants