Unable to recreate substitution mutations in AlphaGenome

Hello,

I am interested in running some test cases for AlphaGenome to get a sense of how it works.

I want to see the change in RNA expression in Lung, when a substitution causes a stop codon to be produced.

To see this, I wanted to recreate a mutation in CFTR namely the substitution of G to T, causing Glycine to change to a stop codon. GGA → UGA

I have taken this data from a credible source:

I used this data: * NC_000007.14:g.117587778G>T
as AlphaGenome deals with the genomic sequence.

This is the code I used:

However, there seems to be no difference in the Reference vs Alternative Plots.

I even cross checked the nucleotide number with the UCSC data.
https://genome.ucsc.edu/cgi-bin/hgTracks?db=hg38&lastVirtModeType=default&lastVirtModeExtraState=&virtModeType=default&virtMode=0&nonVirtPosition=&position=chr7%3A117587763-117587826&hgsid=3314559807_i8e2GbbTivHOglXUsMheuNzmHqRS

I have been struggling with this issue for quite some time now, and would greatly appreciate some feedback.

Is this an issue with AlphaGenome or an issue with my code?

Thank you

Hello,

Thanks for reaching out. The code looks fine, and it looks like the variant input is correct, at least it matches the variant ID provided in the NCBI ClinVar record provided. We cannot comment on the robustness of the assumption that the variant is pathogenic.

We recommend also looking at the variant scores for this variant rather than just the REF/ALT visualisations, as sometimes subtle effects can be hard to observe in the visualisations (see the Tutorial Scoring and Visualising a single variant for how to do this).

While we extensively measured performance of AlphaGenome across a variety of modalities and use-cases (see the preprint), there is no guarantee that AlphaGenome predictions will match expectations from known, or putative biology in every case.

Hello, Thank you very much for your reply. I tried to recreate the code as in the tutorial, just changing the variant for CFTR, but end up getting the same plot as I had gotten above, with no difference in the reference and alternative sequence, as indicated in the image below:

Does this mean that this is a test case that AlphaGenome is not able to recreate? I thought that because it is a simple substitution to generate a stop codon, we will be able to see a difference, however small.

Can I assume that AlphaGenome cannot give me a correct prediction for this test case?