Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a script that can batch import and explain Lancet data in the November format #428

Merged
merged 6 commits into from
May 7, 2024

Conversation

adamnovak
Copy link
Member

This fixes #427 sort of by making a BED that explains what variant is supposed to be seen.

I ran it like this:

cd ~/workspace/sequenceTubeMap
mkdir -p webhost
cd webhost

~/workspace/sequenceTubeMap/scripts/prepare_lancet_output.sh ~/Downloads/STM_DataShare_Nov07_2023\ 2/ ./lancet_2023-11-07

cd lancet_2023-11-07

python3 -m http.server

Then I could put http://[::]:8000/index.bed into my local tube map as the BED file and browse the data.

But, I think the data provided is not exactly what I really want to look at. It looks liek the called tumor variants are not actually in the graphs. For example, at chr1:38506973-38506974 I am suppsoed to see a Tumor-specific CTGGAATCCAGCAGCCCAGACTTCCACATCATAATTTTCTGGGGCAATGGTTTTCAAACTTCACTGTACG -> C DEL variant. But the graph doesn't show a large deletion. Instead, that deleted sequence occurs 1 base into the leftmost node here, and I can see the softclips in the aligned tumor reads at their left ends, where they would read over the deletion edge that isn't present. Here's a screenshot of the softclips, with the tumor reads in red:

Screenshot 2024-05-03 at 16 56 40

So I now have a way to generate and host tumor-normal Lancet examples, including examples for larger indels, but the graphs coming from Lancet don't really seem to be the right ones.

@adamnovak
Copy link
Member Author

OK, I talked to Rajeeva and he told me that Lancet makes several graphs per variant. I added code to find the most centered one for each variant, and now I can indeed see this variant. Here's that deletion being taken by just the tumor reads:

Screenshot 2024-05-07 at 17 24 37

These are now probably usable examples.

@adamnovak adamnovak merged commit 5b1a3f5 into master May 7, 2024
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Document which variant is meant to go with which Lancet example
1 participant