Seeking Guidance on Relevant AlphaGenome Features for RNU4-2 Variant Interpretation

Meghna · December 11, 2025, 11:58am

Hi all,

We are working on variant interpretation for RNU4-2, a very small spliceosomal snRNA where pathogenic variants cluster within very short structural elements. Our current AlphaGenome-based model uses only five features and fails to correctly classify known ClinVar-pathogenic variants in this gene.

From the scoring script, the features we currently derive are:

center_mask_atac (CenterMaskScorer on OutputType.ATAC)
H3K4me3 (from OutputType.CHIP_HISTONE, histone_mark = “H3K4me3”)
H3K27ac (from OutputType.CHIP_HISTONE, histone_mark = “H3K27ac”)
gene_lfc (GeneMaskLFCScorer on OutputType.RNA_SEQ)
splice (CenterMaskScorer on OutputType.SPLICE_SITES)

For a compact ncRNA like RNU4-2, are these modalities and scorers expected to carry meaningful signal?
And would you recommend additional AlphaGenome OutputTypes or scorer configurations that might better capture ncRNA-specific constraints or highly local structural effects?

Thanks!

Jun_Cheng · December 17, 2025, 2:58pm

Do you have any hypothesis on the mechanisms of the variants? AG can predict expression, splicing, chromatin, it can not predict missense variant or variant affect RNA structure.

Meghna · January 27, 2026, 1:30pm

Our working hypothesis is that RNU4-2 pathogenic variants act primarily through disruption of RNA structure and spliceosome assembly, rather than through classical regulatory mechanisms like transcriptional control.

We fully agree that AlphaGenome does not directly model RNA structural disruption or ncRNA-specific molecular mechanisms, and that this is likely why the current features fail for RNU4-2.

What we are trying to understand is whether any indirect or proxy signals within AlphaGenome (e.g. local chromatin accessibility, promoter-associated histone marks, RNA expression perturbation, or splicing-related outputs in nearby genes) are expected to carry any meaningful signal for such a gene — or whether RNU4-2 should be considered largely out of scope for AlphaGenome-based variant scoring.

If the latter is the case, that would already be a valuable conclusion for us in terms of defining model boundaries and motivating complementary approaches.

Topic		Replies	Views
Can alphagenome be used also for CODING variants affecting splicing? Help & Support	1	748	November 10, 2025
Understanding the AlphaGenome generated Result of ANRIL(lncRNA) variants' impact Help & Support	2	168	January 28, 2026
Feature request: custom transcripts in RNA-seq variant scoring Feedback & Feature Requests	0	59	February 3, 2026
How to use a different GENCODE version (e.g., v49) with score_variant()? Help & Support	1	127	January 28, 2026
I made a easy to use screening method for larger variant dataset Community	0	977	July 22, 2025

Seeking Guidance on Relevant AlphaGenome Features for RNU4-2 Variant Interpretation

Related topics